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ABSTRACT 


Iverson's original API differs substantially in some 
respects from implemented versions. In the course of 
implementing APL deficiencies have crept into the language 
in areas such as subscripting, branching, mixed operators 
and functions. The fcrmer part of this work is a review of 


these deficiencies as well as proposals te correct them. 


The latter part of the thesis examines Abrams' work on 
an APL machine and proposes changes tc it. The APL machine 
is a stack-oriented machine which can simplify APL 
expressions and defer their execution. Simplification is 
accomplished by means of transformations to the polynomial 
access functions of arrays. Deferral allows simplification 
to be carried out over larger expressions as well as 
optimizing the use of low speed core memory. The APL 
machine is more efficient than current implementations which 
immediately apply operators to their operands creating 


temporary results in low speed memory. 
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CHAPTER I 


SS ae 0 ee ee 


The design and implementation of programming languages, 
such as AFL({A Programming Language), is a compromise between 
design goals and pragmatic considerations in the 
implementation. Experience in implementing a language 
affects changes in the design of that language as well as 
influencing the design of new languages. This feedback 
mechanism iS a major component in the development of 
programming languages. In discussing design criteria 
Iverson{ 14] states that a programming language should be: 

"concise, precise, consistent over a wide area 

of application, mnemonic, and economical of 

symbols; it shcuid exhibit clearly ccnstraints 

on the sequence in which operations are 

performed; and it should permit the 

description of a process to be independent of 

the particular representation chosen for the 

data." 
Through the several implementations by L. M. Breed(8], 
P. S. Abrams[1], P. Berry[5], and others, many concessions 
have been made to ease the task of implementation. Although 


the results are readily recognizable as APL the design 
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criteria set forth by Iverson have nct been rigorously 
applied to the alterations made. We will examine the 
language and implementation as described ky the APL/360 
User's Manual{12]. It is assumed that the reader has a 
reasonable working knowledge of the APL language and 
programming. Appendix A contains a summary of the operators 


in the language. 


The utility of a programming language is dependent to a 
Significant degree on the consistency of its definition. 
APL is a simple and consistent language that has acquired 
some flaws in the precess of implementing. Chapter II is 
concerned only with criticizing some of the inconsistencies 
of the implemented language. The criticisms deal with 
inconsistencies in the language such as ccontext-dependent 
operators, differences between functions and operators, 
operators which can be usefully extended to arrays and the 
branch operator which violates several of the design 


criteria mentioned by Iverson. 


By applying the design criteria more rigorously the 
power and generality of the language may be increased. 
Chapter III attempts to do this. A new data type along with 
changes to the syntax and definition of the language is 
shown to eliminate some of the problems described above. A 


hew operator - Rename - is proposed to make the additions to 
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the language more widely applicable. an I/0 facility for 
sequential files is included and made to resemble the 
present use of variables. Iverson's suggestion that the 
sequence cf operations should be clear is used as 
justification for eliminating the branch operator and 
replacing it with the syntax IF, THEN, ELSE and REPEAT, END. 
The motivations for the changes in Chapter III are increased 
consistency and the development of a more powerful and 


general programming language. 


One of the concessions made to ease the task of 
implementation was the altering of the syntax of Iverson's 
APL so that the language became essentially a language of 
operators. In Chapter IV a proposal for an implementation 
by Abrams is examined. The proposal takes advantage of the 
operator nature of the language. Abrams[1] shows that scme 
expressions in the language can be simplified so that fewer 
operations need to be done to execute an expression. In 
addition all expressions in the language can be executed 
more rapidly if the execution is organized so that slow- 
speed memory is referenced as seldom as possible. Abrams' 
proposal is described in terms of a machine with three 
parts. The first part is a pseudo-compiler, the second an 
algebraic simplifier, and the third an executor. The 


pseudo-compiler generates code for the algebraic simplifier 
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from APL text. The algebraic simplifier interprets its code 
to produce simplified code for the executcr. The simplifier 
uses the stack-oriented nature of Abrams machine to 

generate executor code that will reference slow-speed memory 


efficiently. 


The modifications proposed to the language attempt to 
retain the simplicity and consistency that APL is known for 
while introducing facilities to make it a more powerful and 
useful language. Abrams' machine and the modifications 
proposed to it form the base for either a software 


implementation or a hardware design. 
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CHAPTER II 


AFL is usually described as an array-oriented language, 
although it could just as well be characterized asa 
language of Operatcrs. These two aspects are fundamental to 
the formation of APL expressions. The language is designed 
to achieve the following goals: 

1. To have a simple and explicit syntax for all the 
primitive operators in the language. 

2. To extend in an obvious manner the definition of 
primitive operators to arrays or other data types 
incorporated in the language. 

3. To be as consistent as possible without reducing the 
power of the language. The consistency should be 
such that it enhances the simplicity of the 
language. For example, operands should have 
consistent conformability requirements, functions 
should closely parallel operators in their usage, 
and added features such as new data types or 
improved I/O facility should be included with as 
little change to the framework of the language as 
possible. 


In the remainder of this Chapter the discussion will centre 


Pp 


A a cr | 


svn Pom ss ae . 
r _ saandazage, 20, 
sewage Wap Te dons 


atte on Wwalitseosd. : : 


gag Aid ots qeits = eiqa ts, ¢-ewas ot be o 
a 


+ apaiiy ne £ £° sswte7sgo +i 
fo, cbae ors tel oO ; eee ear a6 gf. t La4ese> oF 
7 ! om 


Ly 


@ a sad igltetad 
ot betubingh 92 axe 9 


ivr geieapees Pus Be a sit 


’ 


23445. % 0) daazo SS “Voaee ow ot0tenaao avisiniwy | . 
amigaat @lz-ne babesnigaooad _ 
ay, uataeses Meher od e Ps) a Suis? 3 1Bi9>; 68 od SE ~e 
oh BipOns yoretebahos Fite Spi guat ‘paz 20° TEE f = 
ait- fo. ytidtidate edz arom Gnne +E tea> sole. _ 
eer! &irarterarntoqo iquex 19% ceed 
acd Pioais ateensat op 34 riek Hbiipi aes *istetenoe 
Ree ae aeea? PL 270 78 iGo letlsasy theeoin tivode a a 
20 pices | Gist Wars adobe Saaudetd bapoe baw - : 
_ 
a 


od ded besietond sil hipiee tae tenes OF coeongad — 


ae Bpeapaal. 3y te tadtendr Shs oy sate ical = 


on the API language - the formation of expressions - as 
distinct from system commands, or function definition 


facilities included in implementations. 


A variable, a number, a niladic function, and a niladic 
operator are the simplest APL expressicns. More complex 
expressions can be formed from monadic and dyadic functions 
and operators with expressions as their operands. Functions 
differ from operatcrs in that they are not predefined in the 
language. This distinction is important in the design of an 
implementation but has relatively little effect in the 
language. Expressions are interpreted right to left unless 
parentheses are used to delimit complex expressions as left 
operands. These few simple rules form the basis for 
constructing APL statements. Reduction and inner and outer 
product are useful features in the language which provide a 
flexible means for forming many new monadic and dyadic 
operators. Although APL is noted for its consistency[ 4] 
there are a few areas which are inconsistent. Some of these 
areas, such as subscripting, functions, specification, 
branching and the definition of mixed operators, are covered 


in the remainder of this Chapter. 
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2.1 Subscripting 


Subscripting, although classified as a mixed operator, 
does not generally allow as an operand an expression that is 
valid in any other context except subscripting. It also may 
take precedence over other operators, a violation of the 
Light to left parsing procedure. These discrepancies are a 
result of defining subscripting in a manner similar to the 
classical form wherein the subscriptee is immediately 
followed by a delimiting pair of brackets which contain the 
indices. The indices are separated ty delimiters such as 


semi-colons. 


AFL has generalized the classical definition so that 
the subscriptee may be a variable, a niladic function that 
returns a result, or an expression. The individual indices 
must be valid expressicns. The number of indices must be 
equal to the rank cf the subscripted expression. The rank 
of the result is the catenation of the ranks of the indices. 
Thus a vector of length one can be subscripted by an n-rank 


array to produce an n-rank array. 


Despite the generalization of the definition of 
subscripting the inconsistencies incorporated reduce the 
utility of the language. For example, Index is the only 


Operator which reflects the rank of the operand in the 
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Syntax. This inconsistency leads to the following 


difficulty. If a dyadic function is to be written which has 


as a left operand an array of variable rank and dimensions 


and as a right operand a vector which specifies a scalar 


component of the array to be returned as a result, the 


function can be defined as: 


V Z+*ARR SINDEX V 


C1] +1+pV,2+«10 


AO] +0,09Z+ARR[V] 


fs). +0,0pZ2*+ARRCV(1);VC2]) 


C4) +0,092*ARR(VL(1I);V02);V03I) 


[wv] +0,0pZ*ARRLV[1]; bee ee VOW 1 


V 


Examination of SINDEX shows clearly the awkwardness that 


results from the definition of 
of SINDEX could be extended by 


right operand. 


However the definition of 
that it could be used in place 
proklems inherent in extending 


the following examples: 


subscripting. The definition 


allowing an array as the 


SINDEX cannot be extended so 
of the operator Index. The 


SINDEX are demonstrated by 
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Seen A RTA <C. 
In the first case the indexing operation can easily he 
replaced by a functicn. However the second case 
demonstrates that subscripting accepts as operands 
expressions which are valid only in the ccntext of 
subscripting. The third case demonstrates that subscripting 
is really two different operators - if it is an operator - 
which is dependent on the syntactic context for the 
interpretation of its meaning. It can be an operator which 
fetches values from an array or in ccnjunction with 
Specification stores values into an array. The second and 
third examples demcnstrate that subscripting cannot in 
general be replaced by a function. The result is, that when 
proposals are made for alternative definitions of 
subscripting[7], new primitive operators have to be added to 


the language. 


The design of the language should not be such that it 
forces a user to implement primitive oferators in crder to 
reasonably solve his specific problem. Rather, primitive 
operators should be incorporated in the language because of 
heavy use or because they introduce a facility that 


otherwise could not ke obtained. 
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2.2 Functions 


Functions allow the power of the primitive operators to 
be extended(4]. Functions may be classified in six ways. 
They are niladic, monadic or dyadic depending on the number 
of operands and they may or may not return an array as a 
result. A niladic function that returns a result can be 
included in an expression as if it were a variable. Monadic 
and dyadic functions that return results can be considered 
aS Operators in most cases except that present 
implementations do not allow dyadic functions to be used in 


reduction, or inner cr outer products. 


There are two methods by which arguments may be passed 
to an invoked functicn. Two, or fewer, arguments may he 
passed as explicit operands to the invoked function, and a 
result may or may not be returned. The values of the 
explicit operands will not be changed on return from the 
invoked function. This parallels the definition of scalar 
operators. Arguments may also be passed implicitly. Any 
variable which is defined at the point of function 
invocation will be known within that function by the same 
name unless it is declared local to that function or has the 
same name as one of the explicit operands in the function 
header. In order to pass three or more distinct arguments 


to a function there must be a deliberate correspondence 
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between variable names in the called and calling function. 
Inadvertent corresfondences can, on occasion, lead to rather 


subtle interaction between programs. 


Functions and operators differ in the way in which they 
pass arguments. A function's arguments are passed by value 
while an operator's arguments are passed by name{3]. In APL 
the facility for passing arguments by name is different than 
ALGOL's facility as expressions are evaluated before 
operators are invoked and thus expressions cannot be passed 
aS arguments. The essential difference between call by name 
and call by value is that call by name allows access to the 
arguments of a routine as known at the point of invocation 
whereas call be value has duplicated the arguments and these 
new arguments are kncewn only within the invoked routine. As 
a conseguence the Specification operator cannot be replaced 
by a function. Thus, it is open to the same criticism as 
was subscripting. Another aspect of passing arguments by 
value is that expressions can be formed which are valid and 
executable, but because of implementation restraints cannot 
be replaced by a function. For example: 

1. A+TCAT+pB] 
2. AtFUNC B 
The array B can be made sufficiently large so that example 


one will execute and example two will not. The problem in 
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example two is that the array B needs to be allocated twice 
in memory so that it can be passed by value. Although this 
is an implementation restriction it is artifically created 
by the design of the language. The beneficial aspect of 
call by value is that it need not be a matter of concern for 
novice prcegrammers. An implementation designed to serve 
this group would do well to retain this feature. In a more 
general environment the facility and efficiency of call by 


Rame is werth having. 


The definiticr of a function consists of two parts, the 
functicn header and the body. The body consists of numbered 
expressions. In order to pass control from one expression 
to any other expression a branch operator is available. 
Branch is a specialized monadic mixed operator. The 
operator is context-dependent in that it must be the left 
most character in the string defining a given line ina 
function. It is the only primitive operator that does not 
define a result. In larger programs careless use of the 
Branch operator can eliminate any correspcendence between 
flow of execution and physical layout of the program. fhe 
result can be a pregram that is extremely unreadable and 


difficult to debug. 
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2-4 Mixed Qperators 


Two of the goals for APL mentioned at the beginning of 
this Chapter were to provide a simple syntax for operators 
and an obvious extension of the syntax for arrays. The 
definition of the scalar operators achieves this goal quite 
adequately. The definition of mixed operators and functions 
falls short when we consider the constraint of keeping the 
language as consistent as possible. However consistency of 
the language must yield to the power and facility provided 
by the mixed operators. Mixed operators are distinguished 
from scalar operatecrs because of the variety of restrictions 
they place on their operands. For instance, the definition 
of compression or expansion requires a vector or a scalar as 
the left operand. This precludes conformability 
requirements associated with scalar arithmetic operators. 
Some of the mixed operators can be extended so as to reduce 
the restrictions on their operands and provide a more 


powerful facility. 


Encode and Decode can easily be extended to arrays. 
This extension exists in some implementations. Any two 
arrays are conformable for Encode. The resulting dimension 
is the catenation of the dimensions of the operands. The 
original definition of Encode allows a vector as the left 


operand and a scalar as the right. This can be extended so 
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that column vectors from the left operand are applied to the 
scalar elements of the right operand array. If the left 
operand of Encode is an array then Decode can not be used as 
an inverse function. Decode is extended to accept only a 
vector or scalar as the left operand and requires the left 
operand to be confcrmable to the first dimension of the 


right operand. 


Index Generatcr, monadic IOTA, accepts as an operand a 
scalar or one element array. The resulting vector generates 
indices for other vectors or by repeated use will generate 
indices for arrays of higher rank. Like Encode and Decode 
there is a natural extension to Index Generator which allows 
it to generate the indices for an n-dimensional array. The 
operand is a scalar or vector which has values that set the 
upper bound on values in the rows of the result. If the 
operand is a vector V then the result is a ((pV),x/V) 
matrix. The columns of the matrix are the indices of one 
scalar element in the array. The extended operator would be 
equivalent to the follcwing function: 

VY Z+INDEXGEN V 


Ci] Z+((pV),x/V )pLORG+VT(1x/V)-IORG 
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Note that the Index Generator used in the definition of 
INDEXGEN is intended to be the original rather than the 
extended definition. Also IORG represents the value of the 
current origin. The result of INDEXGEN is a vector or a 
crank two matrix. The rank two matrix cannot be used 
directly with the Index operator to obtain the desired 
result but can be of use with an INDEX function defined as: 
VY Z<ARR INDEX IWD 
C1) Z+( ,ARR)CIORG+(pARR)sIND-IORG] 
V 
This is one of the simplest of numerous useful definitions 
for INDEX. Note that INDEX is an example of a function and 


not a suggestion for an extension of a primitive operator. 


Index Of, like Index Generator, has a restrictive 
definition. The left operand must be a vector. INDEXOF, 
shown belcw, is an extension of the operator Index Of that 


is consistent with Index Generator. 
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VaecA, INDEXOF 2 


C1) Z+((ppA),pB )pLORG+(pA)T,((,4)1B)-IORG 


V 2Z+A INDEXOF B;7 


Pi] Z2+(2,9A)T,((,4)1B)-IORG 


ea T+Q((+/Z013]),1+ppA)p1,pA 


[3] 203;2013)]/1x/pBI<7 


C4] Z+((ppA),pB)pIORG+1 04+2Z 
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The Index Cf operator used above is the restricted and not 


the extended definition. Two definitions are given for 


INDEXOF. The first illustrates the relevant points of the 


algorithm. The additional logic of the second is only for 
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the purpose of inserting (1+pA) for indices corresponding to 
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elements in B which are not in A. The result can be 
interpreted as follows. If R is the result of A IOTA B and 
A and E are rank two matrices then the value of B[BR;BP] can 
be found in A with indices R[;BR;BP]. For example the value 
tpiseine Boateb(east}.. Ris 2; 1\]) is the vector 2) 2.) fhererore 
A[2;2] has value 4. The result of this extended Index Of 
can ke used in the function Index above. The extensions of 


Index Cf and Index Generator are therefore consistent. 


The criticisms of this chapter point out some of the 
areas of inconsistency in the current APL. The Subscripting 
and Branching operators demonstrate weaknesses in the syntax 
while functions, Encode, Decode, and the mixed operators are 
examples of weaknesses in the semantics. For the most part 
the criticisms indicate the difficuities encountered in 
implementing a language that was not specifically designed 


to be implemented. 
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CHAPTER III 


Improvements To The Lanquage 


The improvements proposed in this Chapter are intended 
to provide partial answers to the adverse criticism of the 
previous Chapter. fhe inclusion of a new data type, the 
mixed-data type, and a new operator, Rename, would solve 
some of the difficulties associated with indexing and 
functions mentioned previously. The new data type has the 
additional benefit of allowing some algorithms to be easily 
implemented that cculd only be done in an awkward and 
inefficient manner previously. Scme of the current 
implementations have included an I/O facility through 
extended definition of operators and functions. An 
alternate approach that associates symbolic names with files 
is suggested as a more consistent extension to the language. 
The branch operator is replaced by the syntactic constructs 


of IF, THEN, ELSE and an iteration clause REPEAT, END. 


The proposed mixed-data type is a scalar or array in 
which each element is a scalar or array of any dimension and 
attributes except that of mixed-data type. From the point 


of view of the implementation a mixed-data type vector is 
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Similar to a parameter list in other languages. Thus the 
values of a mixed-data vector would be pointers to other 
arrays or scalars. A new dyadic operator for creating 
mixed-data type arrays is ";". The following examples 
illustrate its use: 

ADEA 2 Dil eselecr st 

DCE E NAME 11312519 
The second example will be referred to in the following 
section discussing Rename. It can be seen in the first 
example that the operator ";" is derived from the present 
syntax for indexing. It is also used in a rudimentary form 
to output values of dissimilar data types. In statement 
one, A becomes a three element vector whose scalar elements 
are the scalars or arrays I1, I2 and 13. For example, I1, 
I2 and 13 might be, respectively, a character array, a 
scalar real number, and an empty vector. Other such diverse 
combinations are possible. The scalar components of A can 
subsequently be accessed by indexing A. However the rank 
and dimensions of A{2] are not necessarily that of a scalar. 
In the example above A[2] would have the same attributes and 
dimensions as, and could be used interchangeably for, I2 
until a Specification operation changed the vaiues of A{2] 
SGCelc ett el? 1s a rank thLee acray Chen S(Al 24) 42. sn would 
select a scalar element from that array. The definition of 


the operator ";" parallels the definition of catenation. 
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The essential difference in effect between Catenate and ";" 
is the data type and dimensions of the result. The 
expression A;B may be interpreted as one cf three 
possibilities: 

1. Neither A nor B is mixed-data type. Then the result 
is a two element mixed-data vector in which the two 
scalar components are identical to A and B even 
though they may be matrices. 

2. Only one of A or B is mixed-data type. If A's type 
is not mixed-data then the result will be formed by 
making A a scalar component and "catenating" it to 
the vector B producing a mixed-data vector. If E is 
not a vector then a rank error occurs as the 
"Catenaticn" of a scalar and matrix is undefined. A 
Similar result occurs if B's type is not mixed-data. 

3. Both A and B are mixed-data type. The result is 
mixed-data type and the dimensions of the result and 
the restrictions on the operands are the same as if 
the operatcr had been Catenate. 

It is important in the definition of ";" that scalar 
components of a mixed-data array cannot have the attribute 
of mixed-data. A data type with this recursive definition 
could be supported but cannot be handled in a consistent 
fashion with just ";" because of point (3) above. The 


monadic use of the operator ";" would be similar to ravel. 
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If the operand were not mixed-data type then the result 
would be a one-element mixed-data vector. If the operand 
were mixed-data type then the result would be a mixed-data 
vector with the scalar elements taken from the operand in 


row-major crder. 


All the primitive operators of APL would be defined for 
mixed-data operands with the exception of reduction and 
inner and outer products. Scalar arithmetic operators would 
add, subtract, etc. the arrays which are defined by the 
scalar elements of the mixed-data operands. Mixed operators 
would manipulate just the scalar elements of the operands 


without altering the arrays to which they refer. 


In statement 2 above, B is defined similar to A with 
the proposed Rename operator. Rename does not reallocate 
the values as does Specification. Instead the operator 
allows a value or set of values to be known by two different 
names. Changes tc the values of I1 are reflected in B, thus 
values associated with B are the same values as those 
associated with I1, I2 and 13. B differs form A in that the 
values of A are equivalent but not the same as those of I1, 
I2 and 13. Mixed-data variables are very similar to 


parameter lists in other languages. The Rename operator 
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allows these “parameter lists" to be named and manipulated 
without unnecessarily having to duplicate the values. The 
facilities provided ky Rename do not exist in any form in 
the language. Thus Rename would have to be added as a new 


primitive operator. 
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Function invocation would be changed so that operands 
are passed by name rather than by value. This change will 
have several effects. Programmers will be akle to replace 
operators such as Specification and Index with functions of 
their own definiticn. Implementations would make better use 
of memory as they would not allocate and move arrays 
needlessly. Programs will not be recursive by default. 


Recursion can be forced as follows: 


Les V Z+«FAC N ore V Z+FAC N37 
C1] >0x1Z+N=0 eel. +0x1Z+N=0 
E23 N+«N-i (e2F] T+N-1 
fi) Z+NxFAC N Khe Z+NxXFAC T 
V V 


Both examples are simple functions to calculate the 
factorial function. However the first example will evaluate 
to zero for all positive values of N. The error occurs 


because N is being passed by name and its value is being 
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decremented to zero. Thus N x FAC N evaluates to zero. The 
second example uses the local variable T to explicitly force 


recursion. 
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The Operator Index is considered in its two modes, 
retrieving values from, and assigning values to arrays. The 
retrieval mode A[2;3], would have the paired brackets 
replaced Ly a single graphic in place of the left bracket. 
The result would remain the same only the syntax is changed. 
For assigning values the index operator is eliminated. Its 
function is taken over by Specification. The syntax is: 

MPECA Rit sal s Lia tree es LV) 

2- AHR*B 
In both statements the left operand of Specification is a 
valid data type represented by a valid expression. The 
Specification operator need no longer be context dependent 
to distinguish ketween the two statements as the left 
operands are different data types. In statement 1. the 
array being indexed is the first element of the left operand 
and the indices are all the succeeding elements. If any of 
the indices I1 through IN are empty vectors(10) then ARR 
will have values specified throughout the range of these 


indices. This is the equivalent of the expression 
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ARR(I1;3;;1]+B in the present implementation. These changes 


to subscripting have two advantages: 


1. Index is nc lenger context dependent. 


2. With the changes to function invocation, functions 


can be written which subscript arrays in a different 


manner than Index[7]. Without these changes the 


only method of incorporating a different 


subscripting facility is to add a new operator to 


the implementation. 


The disadvantages cf these changes are: 


1. The Index operator 


does not stand out as clearly as 


do bracketed expressions. Thus complex expressions 


kecome more difficult for people to analyze. 


2. The language becomes slightly more verbose as an 


empty vectcr must be explicitly specified to index 


through an entire dimension. 
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In current iaplementations there are two types of 


statements - specification 
of these causes any I/C to 
expression which cannot be 


specification or kranch is 


and branch statements. Neither 
be done inplicitly. An 
categorized as either 


arbitrarily categorized as a 


specification statement with as implicit Quad and 
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Specification to the left of the expression. Consider the 
following expressicns: 

tee ac Ras 

Cea Ao) 
statement 1. causes no I/O whereas statement 2. does. This 
context dependency could prove inconvenient in situations 
where specification or subscripting has teen replaced by a 
function. For example, if a function SPEC has been defined 
similar tc the Operator Specification, then a statement 
consisting of the expression A SPEC B would cause the values 
of A to be output. This output should not be forced on the 
programmer. Rather, context dependency of expressions 
should be eliminated and the programmer must explicitly 


indicate at what pcint he wishes I/O to be done. 


3.6 I/O Facility 


Ivyc facilities are being added to some APL 
implementations[9] by extending the definition of some 
operators such as Ibeam. These operators are then imbedded 
in locked functions to make their use easier. Although 
operators are part of the language the result of this ad hoc 
approach is that IyvO is being added to the implementation 
and not the language. Consequently there is little 


likelihcod of compatibility of programs between 
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implementations. 


If we analyze the present I/O facility of APL we find 
that Cuad has properties of both variables and niladic 
functions. Quad cannot be a niladic function because a 
function cannot have values specified to it as in the 
following expressicn: 

Q+A 
If we consider Quad to be a symbolic designation of a 
special variable - despite the inconsistencies - we can 
expand this primitive facility into a more general forn. 
This can be done by adding a mechanism to the language which 
allows any valid symkolic name to be declared as an I/0 
variable. There exists in the language a mechanism for 
declaring - defining - symbols to be functions. A similar 
mechanism would declare a symbol to be an I/O variable. A 
symbol could then represent one of three things, a variable, 
an I/O variable, or a function. From the point of view of 
the implementation an I/O variable would be quite distinct 
from a variable. However, in the language they should be as 


similar and interchangeable as possible. 


When defining I/O variabies one of two types, a simple 
type and a complex type, would be specified. This 
information is primarily for the implementation's I/0 


support routines but would have effects in the language. On 
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output through a complex type I/O variable both the data and 
its descriptor would be written to the I/0 device. Details 
concerned with blocking, the host operating system, the 
machine model, and the format the data is written in would 
be specific to the implementation. Thus data in this format 
is likely to be incompatible with other implementations and 
I/O devices. On input the data is self-describing and the 
I/O variable virtually indistinguishable from a variable. 
The complex type is intended for data bound for intermediate 
mass storage. Arrays of arbitrary rank and attributes can 
be output and retrieved without recourse to formatting 
schemes and intermediate data forms. As the descriptors are 
included in each I/O operation there is no necessity for 


conformakility between two successive I/O operations. 


The simple IyO variable is a more powerful and general 
facility but very inflexible in its use. On input or output 
the Iyv0O variable must be a boolean vector. On input a block 
of data received from an I/O device will be considered as a 
string of bits which will determine the values of a boolean 
vector. The length of the string will be determined by the 
number of bits received from the device. Any communication 
control characters in the string would not be stripped by 
the I/O support routines. Output must be a boolean vector 


which has any necessary control characters included in the 


Siuav ah wetetue +} WeGh O8a 

siare® 2502 vi sab sae 
bre SAC LIT aval, ad sotba wh seals 3 od oF 

ed? tin paiutipeob-tiseh ef sash adt tugnk a0 «BODE 

ei@elvav © ovr tMpde opel Sebbod thienasty nies 

oisiseadseint (ir vevod areb a8 Boiarraed 2! euy? «© 

aad parbdisess Ooo Ade, yreatides te aYei3h 4? 

\abtiadeot a2 eashooe snenely Bayskrls3 Bas 4 

Sif <35fgiltcsh Six 24 .2ua0P seep etetiadsernk ban & 

703.43} 48anb of er #4849 robs exago OT dose at 

~snO LF RIT4G0- H, a1 seneneus ouy mhawred PILL 


issenve Sra fobsoved o20a "= ab eldaltay OXT Sigate-odt 

Mais Zo tata” mp 200 8¢4 ab afdixysiict yaar gut j b 
inolu & SA *2OraeaV upetocd i oh toy oidndsev OD 
s 25 dient 2 30° ed\ Dit saaved BM a& 8632 feVisoed: 
ni Loop! ‘ ia Baussy fale yeeraara) itiw dsits atid to 
ase) Weal id itie wabide ite do dogual wd? ¢: 
Bo Etevtiteamo, bd hie pais Deviezes arid sisi 


28 


string. The primitive Iv0 facility is designed to get the 
data in and out of memory in a form characteristic of most 
digital computers. Formatting functions written in APL can 


provide the transfcrmation of data into a convenient form. 


The appearance of I/O variables in the language is 
Similar tc variables. For example: 

1. A+*FILEIN 

2 OF FLEOUT SPE 
Assume A and B are variables and FILEIN and FILEOUT are I/O 
variables. If FILFIN is simple then A will be specified as 
a boolean vector. Otherwise the descriptor of A will be 
determined by the descriptor accompanying the data which can 
be assumed valid. If FILEOUT is simple then B must be a 
boolean vector. If FILEOUT is not simple then the 
descriptor of B will be preserved. Each time an I/0 
variable appears as an operand in an expression being 
executed an I/O operation is performed to transfer the 


appropriate values. 


IyO variables differ from variables in that they can 
not be sukscripted when they are the left operand of 
Specification. The restriction arises in the following 
expression: 

CPILEs I1%02)+A 


If, in the process of declaring FILE, it was declared 
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“write-only" then an error will be indicated. The indices 
are not used to access an "indexed" file as this is not the 
interpretation that would be applied if the file were an 

input file. If FILE is "read-only" then a different error 


will be indicated. 


3.7 TF, THEN, ELSE and REPEAT 


The control structures IF, THEN, ELSE and REPEAT, END 
are to be included in the language and the Branch operator 
eliminated. There are two reasons for eliminating Branch. 
Primarily, the alternative control structures make many 
programs easier to understand and debug[ 16], and secondly, 
the syntax of the Branch operator is not consistent with the 
rest of the language. Eliminating Branch also eliminates 
the possibility of constructing iterative groups of 
statements. REPEAT, END replaces this facility with a 
Syntax that more explicitly defines the group. The syntax 
for these control structures is: 

1. IF (BOOLEANEXP) THEN (EXPS1) ELSE (EXPS2) END 
2- REPEAT (REPEXP) 

(EXP 1) 

(EXP2) 

(EXPN) 


END: LABEL 
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The symbols that are not parenthesized are constant and must 
appear as Shown for the syntax to be correct. Parentheses 
and parenthesized symbols are to he replaced by an 
expression or expressions. The results of example 1. are 
well known from ALGOL. BOOLEANEXP represents a single 
expression which evaluates to a scalar or single element 
array which is boolean. EXPS1 and EXPS2 represent zero or 
more expressions. A null expression results in no 
execution. iIn example 2., REPEXP is a single expression 
which evaluates to a scalar or single element array that is 
a positive integer. It specifies the number of times the 
group of expressions; EXP1, EXP2, EXPN, will be executed. 
END or END:LABEL delimits the group. LABEL is a string of 
characters that uniquely identifies, within a function, the 
END of a REPEAT group. The rules for forming strings which 
are valid labels are the rules for forming variable names. 
Two control structures associated with REPEAT are CYCLE and 
LEAVE or CYCLE:LABEL and LEAVE: LABIL. CYCLE causes 
execution of the next iteration to begin immediately with 
the first expressicn in the inner-most REPEAT group in which 
CYCLE appears. LEAVE causes execution to continue with the 
first expression after the inner-most REPEAT group. If a 
lahel is suffixed to the CYCLE or LEAVE then the REPEAT 
group that executicn is transferred to is the one identified 


by the label. This facility is useful when execution is to 
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be transferred out of several nested REPEAT groups{ 18}. 
Errors would be indicated if the label were not defined 
within the function or if the CYCLE cr LEAVE was not within 
the REFEAI group that the label was attached to. An example 
of a bukbble-sort program illustrates the use of this new 
syntax: 
V SORT*GRADEUP V;D3sI3d 

iF) SORT+ iI «pV 

esa REPEAT I<«I-1 

an FLAG*+Oxd+pV 

C4) REPEAT If 


53) IF V(SORT(J-1)J>VCSORTC I] 


C6] THEN D+SORT(J-1] 

L7d SORT(J-1)+S0RTCI] 
[8] SORTCJJ+D 

ch FLAG+1 

C10] ELSE END 


C11] J+J-1 
[12] END 
liom) eel timid 
C14] IF FLAG TREN CYCLE ELSE LEAVE END END 
V 
There are several implicit points in the program above which 
should be stated. One is that repeat groups are independent 


of subsequent values of variables that initialize then. 
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Thus when line 2 is executed a counter is set up which is 
not affected by changes in the value of I. The second point 
is the method by which expressions are delimited. When 
there are several expressions to be executed the only method 
of delimiting them is to list them on separate lines as 
Shown by Lines 6 through 8. The third point is the 
inconsistency of the reserved words added to the language. 
This objection could be overcome by replacing the reserved 
words with single character graphics. There would then be 
considerable similarity between the new control structures 
and parentheses. However, a considerable degree of 
intuitive appeal is lost by the replacement of well known 
symtolic names. As the reserved words have well known 
meahings it is suggested that they be retained. The fourth 
point is the need for preprocessing of programs. Consider 
the fcllowing statement: 

IF DIR=0 THEN I«1 ELSE I*ILAST END 
This statement cannot be executed right to left as other 
statements can. Thus a preprocessor must break it up into 
three separate parts so it can ke properly executed. In 
current implementations this would be awkward. However, if 
the implementation is done as suggested in the following 
Chapter this preprccessing could easily be incorporated in 


the pseudc-compiler stage. 
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Any program that will terminate in finite time can be 
represented by three kasic control structures[ 16}: 

1. Simple sequencing 

eee ee LuGNe rb ok 

3. REPEAT, ENC 
Programs written with these control structures are 
characterized by one entry at the start of the program and 
one exit at the end. The control structures ensure that 
execution takes a direct path from entry to exit. Thus all 
programs written will be very similar to the straight-line 
algorithms or methcd of leading decisions that Iverson 
advocated in his book "A Programming Language." In addition 
all non-recursive frograms must terminate in a finite time. 
All lcorfs must use REPEAT which has a finite replication 
factor. These factors combine to make prcegrams easier to 


understand and dekug. 


A. L. Anger{2]} has suggested the addition of an 
embedded dyadic branch operator. This would allow 
expressions of the following form: 

Z+Atd>LABEL 
In this expression if LABEL were not iota zero then the 


branch would be taken and the left operand of the branch 
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operator would be left unevaluated. If LABEL is iota zero 
then the result of the branch operator is simply the left 
operand J. Although this suggestion eliminates the 
syntactic weakness cf the branch operator it introduces the 


ability to construct APL statements that are very obscure. 


J. Ryan{17] has suggested the introduction of lists as 
a data type to the language. A ";" operator would be used 
to construct the lists. The following expression would 
generate a binary tree, T, with leaves A, B, C, and D: 

T+((A3B);(C3D)) 

The data structures that can be created using this operator 
are superior to the mixed-data type mentioned above. 
However the operator has kFeen made context dependent in 
order to achieve these structures. The removal of 
superfluous parentheses from the above expression changes 


the shape of the tree obtained. 


S$. Charmonman{11] has proposed a modification to the 
definition of confcormability for the dyadic scalar 
arithmetic operators. Charmonman proposed an extended 
definition that wculd allow any two operands to be 
conformable. In the case that the ranks of the operands are 
not the same then the operand with the smaller rank is 
reshaped to the rank and shape of the operand with the 


larger rank. If the ranks are identical but the operands 
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have different dimensicns then the operand with the least 
number of elements is reshaped to the rank and shape of the 
operand with the greatest number of elements. If the 
dimensions are different but the number of elements in the 
operands is the same, then the left operand is reshaped to 
match the right operand. However, as L. Breed[{10] has 
pointed out, this generalization to the definition of 
conformability destroys the properties of associativity and 


communtativity. 


The APL PLUS file subsystem was designed by L. Breed[9] 
and E. B. Iverson to add I/0 capability to APL. Their 
design objectives were: 

1. Operation with APL/360 with no changes to the 
language. 
2. Efficient cperation with arrays as objects of data 
transfer. 
3. File access does not envolve extralingual system 
commands. 
4. File sharing and high reliability. 
To achieve these objectives a single primitive operator was 
introduced. Then, for user's convenience, functions FTIE, 
FCREATE, FREAD, FAPPEND, FREPLACE, FRDCI and FKOLD were 
written all using the new primitive operator. fFTIE and 


FCREATE associate a file number with the file's name. 
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in addition creates a new file. FREAD retrieves an 
component cf a file. FAPPEND adds a component at 
of a file and FREPLACE replaces a component. fFRDCI 
component information about a file such as the 


number the component was written under and the time 


at which it was written. The FHOLD function allows 


synchronization between two people updating a file 


Simultaneously. 


Compared to other languages it can be seen that there 


are relatively few and simple concepts to be mastered in 


order to be able te obtain a sophisticated I/0 capability. 


In addition it is relatively simple to add this capability 


to current implementations. However, by adding the I/0 


capability to the implementation rather than the language, 


consistency between inplementations may well be lost. 
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CHAPTER IV 


improvements To The Implementation 


Current implementations of APL are simple interpreters. 
Character strings used to define programs have simple 
editing operations, such as removal of excess blanks and 
some standardizing of format, performed on them before they 
are stored. However, a high degree of visual fidelity is 
Maintained. The interpreter uses the edited strings for 
execution of programs. It begins its analysis of the 
strings on the left to check for possible comment statements 
or system commands and then continues with a right-to-left 
parse and executicn of the string. The visual fidelity 
retained in the strings{8] used to execute programs results 
in extra processing in the identification and resolution of 
operands, the identification of operators and the detection 
of simple syntactic errors. Preprocessing of text can make 
operator identificaticn and operand resolution significantly 
more efficient in terms of processing required. 

Considerakle efficiency in memory space and execution speed 
can be gained by deferral of operators and simplification of 
expressions as proposed by Abrams{1]. Deferral of operators 
results in reduced memcry requirements for temporary results 


and less computation for memory management associated with 
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temporary allocaticns. The following expression illustrates 
an ideal situation for deferral of operators: 

AcBitB2+ ... +B9+B10 
Current implementations allocate storage for a temporary 
result of B9 + B10 and then proceed with the addition. 
Allocation of a temporary result is not needed in this 
example. By deferring the add operation until the 
Specification operator is interpreted all the addresses of 
operands of add can te resolved. If the interpreter then 
adds only one scalar component from each operand and stores 
the result in A, temporary storage for only one scalar is 
needed. The process is repeated for the next scalar 
component of the operands. The scalars are accessed in row- 
major crder. After (x/pB10) repetitions A will be 
completely specified. As only one element of temporary 
storage is needed, a register can be allocated for this 
purpose with significant savings in frocessing time. 
Savings in execution time derived from the simplification of 
expressions is illustrated by the fcllcwing expression: 

A+(1 1 1)QB+C 
The result A, is just the diagonal sum of the arrays B and 
C. Current implementations in producing the temporary sum 
of B + C do much unnecessary processing in summing the off- 
diagonal components. Abrams proposed a Simplification 


scheme, called beating, so that the expression above would 
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be calculated as if it were: 

A+((1 1 1)QB)+(1 1 :1)8C 
Beating is the process of determining equivalent expressions 
which take fewer oferations to calculate than the original 
expressicns. Abrams showed that a subset of APL operators 
produced expressicns that are beatable. The above example 
would require processing time that is expcnentially related 
to the number of scalar components of the arrays for current 
implementations whereas Abrams' proposal takes processing 


time that is linearly related. 


Abrams stated his proposal as a design for a machine - 
the APL machine or APLM. Although it is not necessary to 
consider it in machine-like terms, Akrams* proposal will be 
referred in terms ascrikted to machines in the following 
Chapter. Many of these terms have equivalents in a software 
implementation. Fcr instance, registers rather than arrays, 


and bits rather than flags will be descrited. 


A software implementation of Abrams! APLM has been 
undertaken by the author and D. A. James. James worked on 
the DM and the author on a subset of the EM. The EM was 
coded in IBM/360 Assembler Language. The implemented subset 
demonstrated that the fundamental design of the EM is 
correct by properly executing expressions that confined 


their operations tc those of the subset. In addition minor 
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discrepancies in the logic as proposed by Abrams were found. 


4.1 The APLIM 


The APLM consists of three submachines, the C, D, and E 
machines. The purposes of these machines will first be 
described briefly, and in detail later. The C machine (CM) 
"Compiles" raw APL text into an efficient form for execution 
by the D machine (DM). The DM analyzes this code into 
beatable and deferratle expressions. When an operator is 
encountered that cannot be keaten or deferred the DM 
produces code for the E machine (EM). The EM is given 
control to execute the generated code and produce the 
result. It is reasonable, although redundant, to have three 
versions of a program as the different versions are expected 
to reside in different storage hierarchies. EM code may be 
imbedded in several nested loops and therefore resides in 
high-speed register storage. DM code is needed once for 
every invocation of the program in which it belongs and 
therefore resides in intermediate core storage. CM code is 
only needed once to generate DM code and possibly for the 
generation of error messages and can therefore be placed on 


a mass storage device. 


The CM does nct truly compile AFL text as it does not 


resolve the addresses of operands and only modifies 
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operators to make their identification a rapid process. It 
does compile in the sense that it reduces text to an opcode, 
operand form similar to one and two-address machines. The 
DM handles some operators such as monadic RHO by itself, but 
it is essentially a run-time “algebraic simplifier". These 
Simplification precesses cannot be done by the CM as they 
reguire the values of variables that are not determined 
until run-time. The CM is included as part of the machine, 
rather than simply being part of the software, so that the 
machine can be used recursively to evaluate an operator such 


as Unquote. 


4.1.1 Register 


in the APLM 


The APLM consists of stack and scalar registers, core 
memory, and the processing units for the C, D, and E 
machines. The C, [, and E machines all have access to the 
registers and core memory. The registers are as follows: 

12 dteration Stack (IS) 
CTR MAX DIR CH MRK 
0 2 0 0 1 
0 g 0 0 0 


3 3 1 1 0 <= Top Entry 


The IS is effectively a set of nested loops. The ISCTR 


is the CTR field of the entry at the top of the stack. It 
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should support values as large as the largest integer to he 
supported by the implementation. It should be able to take 
on values at least as large as the number of bits in core in 
order to effectively index large boolean arrays. ISCTR is 
incremented or decremented through the range of values 
associated with the loop. ISMAX specifies the initial or 
final value to be taken by ISCTR. ISDIR is a boolean value. 
If ISDIR is zero then ISCTR is initialized to zero and 
incremented to ISMAX, otherwise ISCTR is initialized to 
ISMAX and decremented to zero. If the above set of loops 
were being used to access the scalar components of a rank 3 
array then the dimensions of the array as derived from the 
ISMAX fields are 6, 10, 4. The ISMAX fields are one less 
than the dimensions as ISCTR is used to calculate a 
displacement from the origin of the array. ISCH and ISMRK 
are boolean values. ISCH is used to reduce the number of 
calculations required to determine the value of the 
polynemiai access function. The details of its function are 
explained in the section on the EM. ISMRK is used to 
delimit groupings within the IS. These groupings may 
correspond to the dimensions of arrays, reduction operators, 
or REPEAT groups. Its function will be more clearly 


explained under array accessing in the EM section. 
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2. Location Counter Stack (LS) 
RELSSORG GEENTLCDE SIS FN NWT QP 
10 Z0 055.0 00 0 0 0 0 


0 0 10 01 1 0 1 0 <= Top Entry 


The 1S functions as a location counter that makes DM 
and EM code readily relocatable. A segment of DM or EM code 
is invcked by pushing a descriptive entry to the LS. LSCDE 
specifies which of the three machines to be given control. 

A logic unit, called Maincycle, is given control after every 
DM or EM instruction executed. Maincycle uses LSCDE to 
select the appropriate submachine to receive control. 
Depending on which machine has control the application of 
LSREL, LSOFG and LSLEN is modified. In the DM LSORG 
specifies the starting location of a segment of code in 
memory, LSLEN delimits the extent of the segment, and LSREL 
selects the instruction, relative to LSORG, to be executed. 
In the EM the application of these fields are similar but 
apply to the Instruction Stack (QS) rather than to core 
storage. LSORG should be capable of specifying any core 
address. LSREL and LSLEN need not contain values as large 
as LSORG but smaller values artificially constrain the 
maximum size of programs. LSIS, LSFN, and LSNWT are boolean 
values that, along with LSQP, preserve information 


associated with the pregram segment. LSIS has a value of 
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one if the program segment is being repeated under the 
control of the IS and zero if not. When LSREL equals LSLEN 
the LS entry is popped unless LSIS is one. If LSIS is one, 
LSREL is set to zero, the IS is incremented - stepped - and 
the segment repeated unless the IS has ccmpleted - 
overflowed - its range of iterations. If the IS overflows 
then the top LS entry is popped regardless of LSIS. LSFN is 
one if the program segment the LS entry defines is an APL 
function and zero ctherwise. LSNWT saves the value of the 
scalar register, NEWIT, when another LS entry is pushed to 
the stack. LSQP is a pointer into the QS. It is used in 
the evaluation of sukscripted expressions. Its function is 
further explained in the EM section under subscripting. 


3. Instruction Stack (QS) 


CE ADDR 
S 5 
JMP 10 <= Top Entry 


The DM places the instructions it creates for the EM in 
the QS. The QS serves as a buffer for the EM instructions 
and as a stack for data associated with the EM's operation. 
Although some of the operations performed on the QS are 
stack operations many are not. Thus its designation as the 
Instruction Stack is a misnomer. The instructions shown are 


illustrative of EM instructions. These instructions are 
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discussed further in the section on the EM. 
4. Value Stack (VS) 
IAG VALUE 
ET 6 


ST S <= Top Entry 


The VS serves a variety of purposes. The DM uses it to 
temporarily store fregram segment descriptors, addresses, 
Mames, etc. The EM uses it to store temporary results 
during the evaluation of expressions. This is the main 
stack in the machine and it performs a wide variety of 
functions associated with many parts of the machine. Its 
basic purpose is the storage of temporary results. VSTAG 
specifies an interpretation to be associated with VSVALUE. 


The tag ST identifies 5 as a scalar value. 


1. Memory 

Memory contains arrays, array descriftors, a free pool 
area, and freed spaces. Arrays are kept in the low end of 
core with freed arrays in a doubly linked list. Array 
descriptors are kept in high core and freed descriptors 
doubly linked. Between the arrays and array descriptors is 


a free pool area of allocatable storage. 
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2. Name Index Table (NIT) 

The NIT is a N,3 matrix. Each row of the matrix 
represents a symbolic name in the original text. A 
Character vector, that is the concatenation of all the 
unigue symbolic names encountered in the text, is maintained 
in asscciation with the NIT. The first column of a row is 
the index in the character vector of the first character in 
the name. The seccnd column is the length of the name and 
the third column an INX value to be associated with the 
name. INX is the value that will be used in place of the 
symbolic name in the DM code. The rows of the NIT are 
ordered according to the collating sequence of the names 
they represent. When a new symbolic name is encountered in 
the text it is concatenated to the character vector and 
assigned the next INX value. The INX value is recorded in 


the row inserted in the NIT. 


3. Name Table (NT) 

The NI is an associatively addressed memory. Each 
entry has a name, tag and value field. The name field 
contains an INX value corresponding to a symbolic name in 
the original text. The tag field is similar to the tag 
field discussed in connection with the VS. Tags designate 
various properties of INX values such as scalar, function, 


undefined array, etc. The value field is interpreted 
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differently for each tag. Some value fields contain MeMOry 

addresses. Others, such as those tagged as scalar, have the 
value of the scalar in the value field. Abrams included the 
NT with the registers but its relatively infrequent use 


suggests it should reside in a lower level of storage. 


4.2 Phe C Machine (CM) 


The CM accepts APL text and produces DM code in reverse 
Polish form. DM code has operators coded as fixed length 
opcodes which will index the appropriate operator routines 
when applied to a table of addresses. Operands are also 
transformed into fixed length codes using the NIT. The 
resulting DM code has a format similar to the machine code 


of zero and one-address machines. 


4.3 The 
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The DM analyzes expressions in reverse Polish code 
produced by the CM to produce EM code which will be more 
efficient than simply executing operators in order of 
precedence. A subset of the DM instructions with a brief 
annotation to explain each can be found below. These 
instructions are represented by an assembly language format 
rather than numeric values the CM generates. All examples 


of DM code will use this assembly language. 
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[TM instruction Set 


Load Scalar: IDS scalar 

The scalar is an immediate operand. This instruction 
is tised in the expression X+I+1 to indirectly cause the EM 
to load the constant 1 to the VS. The DM does this by 
generating EM instructions and then passing control to the 


EM for execution. 


Load Constant Array: LDCON disp 

The operand disp represents a displacement from the 
beginning of the program that the instruction is in to the 
location of a descriptor array for the constant. The 


constants are pooled at the end of the program. In the 
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expression (2 3 5)+tA the LDCON instruction would be used to 


define the left operand of Take. In the expression 
(2 3 5)+A the LDCCN instruction would cause the EM to load 


the values of the constant array. 


Load Name and Fetch Value: LDNF name 

The operand name is the INX value assigned by the CM. 
The symbolic name can be recovered from the NIT. By 
searching the NT fcr the INX value it can be determined if 


the name is defined. If the name is defined the NT entry 
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has the information necessary to access the value. In the 
expression K«I+1 the LDNF I instruction would cause the EM 


to load the value cf I. 


Load Name: LDN name 

IDN is similar to LDNF above. In the expression X+I+1 
the LDN K instruction would cause the FM to load the memory 
address of the values cf K. This address can then be used 


by the Specificaticn operator to do the assignment. 


load J-vector: LIJ len,org,dir 
This instruction is defined similar to the function 
JFUN defined as: 
V Z<«JFUN ARG;LEN;ORG;DIR 
19) LEN+ARG(1] 
2 ORG+ARG([2] 
paz! DIR+ARG(3] 
C4) IF DIR=0 THEN 2+ORG+(\LEN)-ICRG 
C5] ELSE Z+«(LEN+ORG-1)-((1LEN)-IORG) END 
Vv 
The result of this function is a vector knowWn as a J-vector. 
J-vectors are an extension of the operator Index Generator. 
In the expression (1 2 3 4)tA the LDJ 4,1,0 instruction 
would define the left operand of TAKE. Nctice that the CM 


will have to detect instances of J-vectors. The efficiency 
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gained by J-vectors is sufficient to warrant the inclusion 


of the LDJ instruction. 


Assign: ASGN no operand 
In the expression X+I+1 the instruction ASGN would 
cause the EM to store values in the VS at K. After the 


values are stored they are popped from the VS. 


Assign and Leave Value: ASGNV no operand 

In the expression X*«I+I+1 the instruction ASGNV would 
be used for the assignment of values to I. ASGNV is similar 
to ASGN except that values are not popped from the VS. ASGN 


would then store the values in K. 


Add: ADD no oferand 

This instruction generates EM code that will add the 
top two entries of the VS and replace the entries by the 
result. Add is representative of many other dyadic scalar 
arithmetic operators. These should re well known from the 
language and will not be repeated here. They are, however, 


part of the DM instruction set. 


Flus: PLUS no operand 


This instruction generates EM code that will perform 


the monadic plus operation on top of the VS and replace the 
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entry by the result. Plus is representative of the cther 


monadic scalar arithtetic operators. 


Take: TAKE no operand 

The right and left operands of TAKE have been pushed to 
the VS as unevaluated EM program segment descriptors. The 
right operand is inspected to determine if it represents a 
deferred keatablie program segment. If the segment is not 
beatable it is evaluated to temporary space. The left 
operand is then used to beat the descriptor of the right 
operand. The program descriptor for the left cperand is 
popped from the tog of the VS. Take is a selection operator 
and does not generéete any EM code. Take is representative 
of the dyadic selection operators Drop (DROP) and Transpose 


(TRANS) . 


Reverse: REV no oferand 

The right operand and the co-ordinate the operation is 
to ke applied to are defined ty segment descriptors on the 
top of the VS. If the right operand is not beatable it is 
evaluated to temporary space. The descriftor for the right 
operand is then beaten. Like Take, Reverse is a selection 
operator and generates no EM code. Abrams defined Reverse 
with an operand K such that the expression O(KJA would 


generate [DM code REV K. This is not sufficiently general. 
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The expression $[+/K]A can not be handled as expressions can 
not be operands of DM instructions. Cther instructions that 
differ in a similar manner from Abrams' are Laminate, 


Compress, Expand, Rotate, and Reduction. 


Decode: BASE no operand 

The left and right operands are evaluated to temporary 
space if they are unevaluated expressions and not variables. 
The Decode routine is invoked and the result evaluated to 
temporary space. fLfecode iS representative of a group of 
operators that are evaluated immediately. These are Encode, 
Decode, Grade Up, Grade Down, Catenate, Ravel, Dimensicn, 


and Restructure. 


Index Generator: UIOTA no operand 

UIOTA is similar to BASE in that it is evaluated 
immediately. It is evaluated to a J-vector rather than 
temporary space. After UIOTA has been “evaluated" the J- 


vector can be deferred and beaten. 


Index: SUBS K 
The operand on the top of the VS must be a rank K 
array. The next K entries in the VS represent the 


subscripts. The DM generates EM code to fetch the operands. 
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It should be remembered that DM instructions need 
interpreting. For example, the instructicns LDNF I and LD 
J appear Similar to the IBM 360 instruction L 9,NAME. 
However the latter instruction will have the memory addres 
of NAME resolved when it is executed. The former 
instructions need interpretation to resolve their memory 
locations. The operand must te searched for in the NT and 


found kefore a value can be accessed. 


These instructions can be used to represent an APL 


program as shown in the following examrples: 


DM Code 
K+J+I+1 K+(J+I)+2 
Address Operation Operand Address Operation Operand 

500 LDS 1 600 LDS 2 
501 LDNF I 601 LDNF af 
502 ALD 602 LDN J 
503 LDN J 603 ASGNV 
504 ASGNYV 604 ADD 
shes LDN K 605 LDN K 
506 ASGN 606 ASGN 

Example 1 Example 2 


The DM code above is just recoded representations of the 
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original expressions. Notice that the CM must Le sensitive 
to context in assegbling these instructions. In assigning 
values to J an ASGNV instruction is used. ASGNV creates EM 
code which will leave values in the VS for further use - in 
this case for assignment to K. In assigning values to K 
there is no further need for these values and the ASGN 
instruction will cause them to ke popped. The CM must also 
be able to detect the double occurrence of J in the 
following expressicn: 

K+ (67) +J+I 
If the CM uses ASCNV for J«ZI rather than ASGN then erroneous 
values of ($¢J) will be used. The need for the CM to be 
examine the context is due to the design cf DM code and not 
inherent in the source language of APL. Making DM code 
contextually dependant results in more efficient execution 


of expressions. 


Most DM instructions facilitate either the fetching and 
storing of variables or operations corresponding to the 
primitive operators in the language. Instructions which 
store or fetch variables are used to construct EM 
instructions which will store and fetch values to and frecm 
memory. The DM builds the EM instructions in the QS and 
pushes a segment descriptor - a location and length 


description - to the VS. When the DM enccunters an 
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instruction corresponding to a primitive operator it finds 
the operands at the top of the VS and checks them for 
conformability. If the operands are ccnfcrmable for the 
operator then the cperator is classed as one of immediately 


evaluated, deferrakle or beatable. 


An operator is immediately evaluated if it can not ke 
deferred or beaten. The operators that are evaluated 
immediately are Decode, Encode, Grade Up, Grade Down, 
Catenate, Ravel, Dimension, and Restructure. Some 
Operators, such as Base, are immediate because they need 
access to all the values of their operands before any part 
of the result can be specified. Catenate and Ravel are 
immediate because of restrictions in the method by which 
array accessing is done. The accessing method is logically 
Similar to a set cf nested loops. Catenate needs two 
consecutive sets of loops and Ravel needs twe parallel 
loops. Without considerable modification to the APLM these 


facilities can not be provided. 


An expression is deferred by creating the EM code in 
the QS for later execution. Any expression can be deferred 
but the benefits cf deferral may be negated by the cost of 
deferring them. Thus expressions are deferred until: 

1. The expression is exhausted and must be executed. 


2. A function is encountered. Niladic functions are 
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treated as variables by the CM. The DM must detect 
niladic functions and cause execution of the 
expression up to the point where the function was 
encountered. The niladic function can then be 
evaluated to temporary space so that the result can 
be treated as a variable. The function cannot be 
evaluated sooner as illustrated by the following 
expression: 

A+NF+B+L/V 
If NF represents a niladic function which alters the 
variables E or V then evaluation cf NF before B is 
specified will likely cause an error. Monadic and 
dyadic functions must have their operands evaluated 
before the functions are invoked for similar 
reasons. 
Specification or one of the operators that is 
immediately evaluated is encountered. Specification 
is not deferred if it has been coded as ASGNV rather 
than ASGN. This is context dependent. 
A General Dyadic Form (GDF) operator is encountered. 
GDFs are a result of inner and outer product 
operators occurring in the source. In this case the 
right operand is evaluated to temporary space. if 
GLF's were deferred then each element of the right 


operand would be calculated as many times as there 
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are elements in the left operand. This needless 
repetition obviously should te avoided. 

5. The deferred code exceeds some implementation 
limits. The size of the QS is likely to be the most 
restricting limit. Multi-dimensicnal arrays, even 
though they may contain only a few elements, require 
considerable space in the QS relative to scalar 
operands. 

6. The result of an expression is needed for the 
process of beating. For example: 

BeS+Ae(71+72+73 ) +x 
Selection operators such as Take are evaluated by 
the DM by the process of beating. The left operand 
of Take can not be used for beating until it is 
evaluated. The expression (71+72+73) is not 
deferred. The expression MxN is however deferred 
until the DM encounters the specification of B. If 
the right cperand of a selection operator is not 
beatable then it is not deferred. The method for 


determining beatable expressicns is explained below. 


The operators that can be beaten are the dyadic 
selection operators Take, Drop, and Transpose and the 
monadic selection operator Reverse. Index can be beaten if 


the indices of the array are scalars or J-vectors. We would 
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expect arrays indexed by scalars and J-vectors to be 
beatable as the same result can be obtained by replacing 
Index by a combination of Take, Drop, Transpose, and 
Reverse. For example, if A is a 10 by 10 by 10 array, the 
following two expressions are equivalent: 

Alisa 6 we] (1) 


(0,5,4)+(3,6,8)4A (2) 


The DM keats operators Ey changing the pclyncmial access 
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function of the right operand. For example if A is defined 


as above: 


EXpression Polynomial Access Function 
A AVi(P Ry Chew =) 7100R 4s 10R 4G 
ees 5))tA AV(P,R,C) = 100P + 108R + 8C + 805 
Cas 5) +A AV(P pRB Clee =, ODE ety 108s +)>CG + 35 
@C1]A AV(P,R,C) = -100F + 10R + C + 90 
oC 2]4 AV(P,R,C) = 100F ~ 10R + C + 90 
City it 1s) OA AV (C) = 100C + 10C +c 
= Guat AS 
(2,1,2)&A AV (R,C) = 100C + 10R +c 
Cl. Fai Jon AV (R,C) = 100R + 10C + R 
P - Plane R - Row Cc. = Column 


P, R, C are zero origin indices 


AV <= ,A 


The EM does not calculate the polynemial access function in 
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an obvious fashion. The mechanism used is computationally 
more efficient and provides better error checking than the 
evaluation of a polynomial function. As a consequence the 
beating process by the DM is more involved. All the data 
involved in beating is contained in the descriptors of 
arrays. For every variable there is a Name Table (NT) entry 
which contains the pointer to a descriptor. A descriptor or 
descriptor array (0A) contains the fcllowing information 
about arrays: 

DATYPE - the implementation type, integer, boolean, 


real, etc., of the values described. 


DALEN - the amount of memory allocated to this 
descriptor. 
DAFILL - the amount of memory unused by the 


descriptor. 
DAREFCNT - the number of references to this descriptor. 


Multiple references may stem from the NT or 


EM code. 

DAVEASE - the memory location where the values are 
stored. 

DAABASE - the displacement from VBASE to the first 


value. ABASE is generaily zero unless 
beating alters it. Its value is the constant 
shown in the polynomial access functions 


above. 
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DARANK - the rank of the array. 

DADIMEN - the dimensions of the array. There are 
DARANK of these. 

DALEL - the coefficients of the pclynomial access 
function. There are CARANK of these. The 
location of an element of an array is given 


by VBASE+(DADELxXINDICES )+ABASE 


In beating an Operator the descriptor of the right operand 


is altered. Beating constitutes the execution of the 


operator by the DM. It generates no code in the QS for the 


EM. 


The algorithms for changing the descriptors are: 

1. Q+tA 
DAABASE*DAABASE+DADEL+.*(Q<0)xDADIMEN-|9 
DADIMEN+|Q 

2. QYA 
DAABASE+DAABASE+DADEL+.x(Q>0)x|Q 
DADIMEN+DADIMEN-\Q 

3. {JIA 
DAABASE*DAABASE+DADEL( J ]xDADIMEN( J] -1 
DEL(J]+-DEL(J] 

4. Qa 
DARANK+1+1/A 
R«+DADIMEN 
D+DADEL 


DADEL*DARANKtDADEL 
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DADIMEN*+DARANK+4DADIMEN 
I+0 
REPEAT (DARANK) 
DADIMEN(IJ+L/(I=Q)/R 
DADEL( I J]++/(I=Q)/D 
dima hy ail 
END 
5. Subscripting by a scalar 
Let K denote that the scalar is the Kth subscript 
(zero origin) 
DAABASE+DAABASE+DADEL(CK]xSCALAR 
DADEL+(K#\DARANK )/DADEL 
DADIMEN+(K#\DARANK ) /DADIMEN 
DARANK+DARANK-1 
6. Subscriptirg by a J-vector 
Let K denote that the J-vector is the Kth 
subscript (zero origin) 
Let the argument for JFUN, mentioned above, be 
TEN; ORG, Ss 
DAABASE*+DAABASE+DADEL(K ]xORG+LEN-1 
DADIMENCKIJ+LEN 


IF S=1 THEN DEL(KI]+-DEL[K] 


The mechanism the DM uses to simplify expressions like 


is shown above. How this mechanism is extended to 
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handle expressions in which A is replaced by a complex 


expression is shown below: 


APL Expression: A«(3+A1+A2)-(7+4B1xB2) 


[CM Code EM Code 
Address Operation Operand QSADDR Cperation Operand 

250 LDNF B2 0 IFA @B2 
25a LDNF B1 1 IFA aB1 
Zoe MULT 2 OP MULT 
Paya) LDS 7 3 IFA @A2 
254 DROP 4 IFA @A1 
Zoo LDNF A2 5) OP ADD 
256 LDNF A1 6 MINUS 
P34) ACD 7 OP ASGN 
258 LDS 3 8 POP 
209 TAKE 
260 MINUS 
261 LDN Z 
262 ASGN 


When the DM encounters the DROP instruction it can determine 
the operands of the instruction by inspecting the VS. The 
top entry of the WS indicated a segment of code, a Load 
Scalar 7 instructicn, in the EM. This segment evaluates to 
the left operand of DROP - which was overwritten 


subsequently by the IFA @A2 instruction. The entry second 
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from tcp of the VS indicated a segment of code, QS addresses 
0 through 2, which is the right operand of DROP. In order 
to keat DEOP the DM scanned for IFA instructions in the code 
for the right operand in the QS. It changed the descriptor 
corresponding to the oferand of the IFA instruction using 
the values of the left operand as described in the 
algorithms above. The operand of an IFA instruction, @B2 
for instance, is the address of the descriptor for @B2. 
Having keaten all descriptors it found by means of IFA 
instructions the DM pops the top entry off the VS and 


continues processing DM code. 


The beating frocess described akove can be applied only 
if the right operand of the selection operator is beatable. 
If the operand is not beatable it is evaluated to temporary 
Space and then teaten. The operands that are beatable are: 

1. Variables cr expressions that the DM has reduced to 
a J-vector. 

2. Expressions formed from a dyadic or monadic scalar 
arithmetic operator with beatable operands. 

3. Reduction with a beatable expression as an operand. 

4. The General Dyadic Form. Both operands are always 
evaluated to temporary space ane are therefore 


beatable. 


5. Expressions cf the form NpSi1 where S1 is a scalar or 
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one element array. Restructure is normally 
evaluated immediately but this special case can be 
deferred in the QS as: 

5 $1 

IRD @TEMP,MASK=X 
The beating process is applied to the descriptor 
@TEMP. 
Subscripted expressions when the the subscriptee and 
the indices satisfy particular conditions. The 
method of kteating selection operators when the right 
operand is a subscripted expression is shown by the 
equivalence of expressions below. The expressions 
are listed in fairs; each pair is equivalent. 
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It is assumed here that the definition of Take and 
Drop is such that v/(|Q)>pA is always zero. If the 
extended definition of Take and Drop is used then 
this condition would be tested for before the 
equivalent expression could te beaten. The indices 


for Take and Drop must ke beatable expressions. The 
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Qth indicy must be beatable for feverse. MTranspose 
requires A to ke beatable as well as Q being a 
permutaticn of (1pQ). The subscript J1 is identical 
tc In where n is (Q11). In general Jm is identical 
tc In where nis (Qim). For example, the follcewing 
three expressions are equivalent: 

Lee CA Irs 2s F301 


(3 
((3 1 2)8A)CJ1372373) 
((3 1 2)8A)CI2;73;71) 


4.4 The EF Machine (ER) 


The organization and operation of the EM more closely 
resembles conventicnal computers than any other component of 
the AFLM. The EM moves scalar values between registers and 
memory, performs various primitive oferations, such as add, 
on the values in the registers, and contricls the mechanism 
for the indexing cf arrays. A subset of EM instructions 


with a brief annotaticn to explain each follows: 


EFM Instruction Set 


Load Scalar: S scalar 


The scalar is an immediate operand of the instruction. 
The scalar operand is pushed to the VS with the tag ST 


denoting it is a scalar value. 
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Initialize Fetch Array: IFA descriptcr, MASK=00011 

This instruction creates an FA instruction and replaces 
itself with the new instruction. FA instructions are not 
generated directly by the DM as IFA instructions are much 
more easily beaten. The DM instruction LDNF causes the 
creation of IFA instructions. The descriptor from the NT 
associated with the operand of the LDNF instruction is 
inserted into the IFA instruction, and the descriptors 
reference count incremented. Subsequent beating of the IFA 
instruction will cause a copy of the descriptor to be made 
and the operand of IFA changed to point to the new 
descriptor. To execute the IFA instruction the EM accesses 
the information in the descriptor as well as the IS to build 
an iteration control block, (ICB) at the top of the QS. The 
entries in the ICE are explained below. MUhe EM pushes an 
entry to the ICB fer every one value it finds in the 
boolean, vector MASK. MASK determines by bit position which 
entries of the IS will be used in building the ICB. fhe 
Cight-most value in MASK corresponds to the bottom entry in 


the stack. 


Fetch Array: FA LINK=10, VBASE=700, SUM=164 
This instruction is used to fetch elements from arrays 
and push them to the VS with the tag ST. LINK is the 


displacement from the FA instruction to the ICE. VBASE is 
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added to SUM to generate the memory address from which a 


value will be fetched. 


Initialize Array Address: IA descriptor,MASK=00011 
IA instructions are very similar to IFA instructions. 
They differ in that the DM instruction LDN generates then 


and that they in turn generate A instructions. 


Fetch Array Address: A LINK=20,VBASE=950, SUM=164 
This instruction generates a memory address and pushes 
it to the VS with the tag AT. Its operands are the same as 


the FA instruction. 


Initialize J-Vector: IJ len,org,dir,MASK=00010 

This instruction creates a J instruction and replaces 
itself with the new instruction. MASK is used to link the J 
instruction with an entry in the IS. The IS entry will have 
an ISMAX field equivalent to the operand len. The org and 
dir operands of IJ become the incr and dir operands of J. 


The curr operand of J is initialized to zero. 


Fetch J-Vector: J curr,incr,dir,INX=2 
The J instruction generates the scalar component of the 
J-vector it defines and pushes this value to the VS with tag 


ST. To generate the value the instruction tests the ISCH 
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field of the IS entry indexed by INX. If ISCH is one then 
ISCTR replaces curr and curr plus incr is pushed to the VS. 
If ISCH is zero then curr plus incr is pushed to the VS 
which is the same value as was previously pushed to the VS. 
The incrementing cf J-vectors is linked to the VS to 


facilitate the generation of indices for subscripted arrays. 


Cperator: OP sao 

The operand is one of the monadic or dyadic scalar 
arithmetic operators. The operation is applied to the top 
one or two values cn the VS and the result left in place of 


the operand or operands. 


Result Dimension: IRD descriptor, MASK=00011 

This EM instruction is really a DM tag denoting that 
the dimension of a resuit can be found by inspecting the 
operands. Segment descriptors in the VS define for the DM 
deferred segments cf code. The dimensions to be associated 
with these segments can be found by scanning backwards 
through them searching for IRD, IA or IFA instructions. IRD 
instructions are inserted when the IA cr IFA instructions 
are not indicative of the resulting dimension. For 
instance, inner and cuter products, reduction, compression 
and expansion are examples of operations in which the 


operands do not have the same dimension as the result. The 
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EM changes an IRD to a NIL instruction. 


Load Segment Descriptor: SGV org,len,mode 

The operands org, len, and mode describe an EM segment 
of code. Org is relative to the SGV instruction. It 
represents a displacement that must be subtracted from the 
current instruction address. Org is made into an absolute 
QS address and the segment descriptor pushed to the VS with 


the’ tag SZ. 


Jump: JMP link 
The LSREL field is incremented by the amount link. 


Link is a signed integer. 


Jump Zero: JO link 
The LSREL field is incremented Ey the amount link if 
the value on the tcp of the VS is zero. The VS is popped. 


JNO is a similar instruction but the VS is not popped. 


Jump Cne: J1 link 


Similar to JO but the VS must be one. JN1 is Similar 


to J1 but the VS is not popped. 


! tea = 
a Ieapes (SEA ain dead: tes: Bas vial 499 e ebuns :Z i 94 


7 ih c 


} - ; ra) : t 
ae oe aeteous izat woe baa. a 38) yeiaa ~ peas a7 DON ee 
: 7 a. =~ 
if) HOT OPA Ge reI2 5 ein tas sf inontomcens * 25RGe s2qe3 
: i /~T 
/ ayefaete ua x edi. ahem yak Reeth no bdnwaseee ietI0S 
; 


dtev BV odt of Soteod at dagabse edt fas Get air: 29 
i n 7 } us 


> 
@ ; : ar 7 

- Raii Gat :qaoG 

7. aide) 

acl sauces sds a he? nsuayoud 21 biett ianed my 


srspsza beapls, 


a 


a koe Ole Sipe. 
7 
da 


Vi tdi FOdoas sit YI He2HeCoteUe S|a! nist: ieee t 
' _ 5 
. Pores 
eggyoq =f “¥ aff | <O28e Sh, e4¥ Bis, 10 we ode 20 ewtey ai? 
md ¢ 
ay ee ao FOH Gi ial sho roast ouster <= astiete ak JuL 


ie stu va ok ‘al a 
sie sud 


70 


Reduction: RED link 

Fush an entry to the VS with tag RT. The value field 
of the VS entry is irrelevant. when one of the operands of 
OP is applied to the VS and a VS entry has the tag RT the 
Operation is not applied. The entry without the RT tag is 
used as a result. Effectively an RT entry is a place holder 
for dyadic operators. Once the reduction has started the RT 
entry will disappear. In the case where the expression, 
(+/10), is encountered the DM takes the appropriate action 
as (10) can not be handled by the EM. After pushing the 
entry to the VS a jump is made by incrementing LSREL by the 


value cf link. 


Mark and Iterate: MIT no operand 

This instruction adds entries to the IS. Scalar 
values, marked by the tag ST, are popped from the VS. The 
absolute value minus cne of each scalar is used to specify 
ISMAX in a new IS entry. ISCH is set to one and ISMRK to 
zero. If the VS value was positive then ISCTR and ISDIR are 
set to zero. If the value is negative then ISDIR is set to 
one and ISCTR to ISMAX. The value of ISMRK in the first 
entry created is changed to one. When an entry is popped 
from the VS and the tag is not ST then it must be SGT - a 
segment descriptor created by a SGV instnuction. |uiieuses 


the segment descriptor to push a new entry to the LS. The 
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LSIS is set to one so that the newly created IS will be 


iterated through and popped form the IS. 


Pop: POP no oferand 


The top element is popped from the VS. 


Initialize Segment Conditional: 
ISC org,len, mode ,MASK=00010 
ISC initializes to an SC instruction. The operands 
org, len, and mode describe an EM segment of code similar to 
the SGU instruction. The MASK will have only one non-zero 
bit. The position of this bit will determine the LINK 


value. 


Segment Conditional: SC org,len,mode, LINK=3 

The instructicn uses the LINK value to test the 
relevant ISCH bit. If the bit is one then an entry is 
pushed to the LS envoking the segment of code specified by 
the other operands. The LINK operand is also a relative 
backward displacement to a group of XT entries in pushing 
the entry to the LS the LINK operand is made absolute and 
specifies the LSQP field. The ISQP field will be used by 
the Index Unit to locate the XT entries. If ISCH is zero 
then the following instruction is inspected. If it is one 


of XS or XC the LSREL is incremented so the instruction will 
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not be executed. Aliso the LINK field of the XS or XC 

instruction is fcllowed to an XT entry. An XT entry is a 
pseudo-IS entry used for indexing subscripted arrays. The 
CH bit in the XT entry is set to zero indicating that the 


corresponding IS entry has ISCH set to zero. 


Initialize Index Load: IXL MASK=00001 
IXL initializes tc an XL instruction. The MASK is 
applied to the IS and an INX value generated in the same way 


an INX value is generated for an ICB. 


Index Load: XL INX=3 
The INX value is used to access an IS entry. The ISCTR 


field is pushed to the VS with the tag S1. 


Index Store: XS LINK=5 

The LINK field is a relative backward displacement to 
an XT entry in the QS. The VS is popped and its value 
pushed to the index field of the XT entry. XT entries are 


explained under subscripted arrays. 


Activate Segment: SG org,len,mode,LINK=3 
The operands cf SG are the same as those for SC. SG 
unconditionally pushes an entry to the LS similar to SC. 


The other functions done by SC are not done by SG. 
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The operation of the EM will be explained by examining the 


example below. 


The explanation will consist of two parts. 


In the first part, the simplest part, A and B will be one- 


element arrays. 


element arrays. 


In the second part A and B will be multi- 


Details relevant to indexing will be 


excluded from the first part and be the scle concern of the 


second. 


Address Operation 


300 
301 
302 
303 


304 


[TM Code 


IDS 


LDS 


LDNF 


ECWER 


LDS 


LDNF 


FOWER 


ADD 


FOWER 


LDN 


ASGN 


Operand 


25 
2 


Z+((Ax2)+(Bx2))*.5 


QSADDR 


0 


1 


2 


10 


why 


EM Ccde 


Cperation 


DEA 


POP 


Operands 

5) 

2 

@B, MASK=00111 
PWR 

2 
@A,MASK=00111 
PWR 

ADD 

PWR 
@Z,MASK=00111 


ASGN 
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Assume A and B to be one-element, three-dimensional 
arrays. The DM envokes the EM by pushing the following 
entry to the top of the LS; 

REGEG ORG SLENIVCDE® 2S FN NWT QP 

0 0 12 01 1 0 0 0 
This entry initiates execution of the EM code in the example 
above. The first three instructions of EM code push .5, 
then 2, and finally B to the top of the VS. In the process 
of being executed the IFA instruction for B has been changed 
to an FA instruction. This transformation facilitates 
indexing. Initialization instructions, such as IFA, do not 
cause the LSREL field to be incremented. Conseguently the 
FA instruction that replaces IFA is executed following the 
completion of the IFA instruction. It is the FA instruction 
which fetches the value of B. The following OP instruction 
pops the top two elements from the VS, performs the 
operation power, and pushes the result to the top of the 
stack. The next five instructions push values to the VS or 
perform the indicated arithmetic operation. The IA 
instruction is executed and changed tc A. Again, A is 
executed immediately after IA. The A instruction pushes the 
memory address of Z to the VS. The following OP instruction 
pops the VS and uses the address to store the value now on 
the top of the VS. The stored value is left on top of the 


VS as the result of the operation ASGN. The following 
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instruction, POP, clears the value off of the VS. Following 
the execution of POP the LS overflows; that is, LSREL equals 
LSLEN and the top entry is popped from the LS. The APLM 


then returns control to the DM and continues execution. 


For the second part assume A and EB are three- 
dimensional arrays with dimensions 3, 5, 11. To calculate 
the result the EM code is embedded in a set of nested loops. 
Each loop is an entry in the IS. For the above example the 
IS would be initialized by the DM to: 


CIRM AXe, DIRE UCHSSMRK 


0 z 0 1 1 
0 4 0 1 0 
TOPP=>7T0 10 0 1 0 


As in the first part when the EM code shown above has been 

completely executed once, the LS overflows. At this point 

the IS field of the LS register is tested. If LSIS is zero 

then the LS is popped as the segment of EM code described by 
this LS entry is not embedded in an IS loop. If LSIS is one 
then LSREL is set to zero preparatory to reactivation cf the 
segment of EM code. The IS is then stepped. To step the IS 
all the ISCH fields are zeroed. Then the CTR field of an IS 
entry is incremented by one and the CH field set to one. If 
the entry overflows - ISCTR equals ISMAX - then ISCTR is set 


to zero and the next lcwer entry in the stack stepped. The 
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stepping process starts with the entry at the top of the 
stack and continues down until an entry does not overflow or 
ISMEK is turned on in the entry. ISMRK marks the outermost 
loop. If this entry overflows then the IS overflows and the 
LS is popped. If, in stepping the IS, it does not overflow 
then the LS will cause the EM code to ke re-executed. 

Notice that the ISCTR fields could be used as arguments to 


the polyncemial access functions for the arrays A and B. 


If we use the ISCTR fields to evaluate the polyncnmial 
access function in the above example we will find the 
difference between any two successive evaluations of the 
polynomial to be one. This suggests incrementation as a 
Simpler and more efficient method of evaluating the 
polyncmial. Incrementing by one is sufficient to index 
deferred expressions but not for beaten expressions. A more 
general method is evident in the execution of EM IFA 


instructions. 


The instruction of QSADDR 2 in the example above is: 
IFA @B,MASK=00111 
The @B operand is a pointer to the descriptor for B. MASK 
is a logical vector. Although only five of its values are 
shown there should be as many as the highest rank the 
implementation supports - generally 32. The one values 


select entries from the IS that are involved in indexing the 
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array B. In this example it selects the top three IS 
entries. MASK performs a useful function when the arrays 
are operands of the Outer Product operation. An IFA 
instruction is only executed once. In the process of 
executing, IFA the opcode and operands are changed. The 
instruction becomes: 

FA LINK=10,VBASE=700, SUM=164 
The link field is a relative displacement to an iteration 
control block (ICB) pushed to the top cf the QS. VBASE is 
the memory address of the values of B. SUM is the 
displacement from VBASE to the value indexed. All these 
fields are set up by the execution of the IFA instruction. 
The values for VBASE and the ICB were obtained from the MASK 
and the descriptor for B. The ICB contains values which are 
added to or subtracted from SUM. Using the ICB we can 
implement the more generalized incrementing function 


mentioned above. The ICB entries contain the following 


values: 
QCSADDR Operation Operands 
We NT Q1=110,Q2=55,INX=0 
13 NT Q1= 44,Q2=11,INX=1 
14 NLT Q1= 10,Q2= 1,1INX=2 


NT and NLT are not opcodes but rather tags which identify 
the ICE entries as being associated with the IS. The 


alternative is QT and QLIT tags that are used for subscripted 
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artays which will be explained later. The INX operands 
index the Nth entry from the bottom of the IS. Thus INX=2 
would index the entry marked TOP in the IS shown above and 
INX is relative to the bottom of the IS as the displacement 
from the bottom to any given entry remains constant. As 
entries may be pushed to the top of the IS after an INX 
value is set it must be relative to the bottom of the IS. 
INX=0 indexes the entry with ISMRK set to one. The (2 
operands are the coefficients of the polynomial access 
function. The Q1 cperand is the product of Q2 and the ISMAX 
field selected by using the INX operand. Q1 and Q2 have 
more intuitively understandable interpretations. Q2 is the 
increment in the displacement needed tc access a value whose 
indices differ from a previous value's by one in oniy one of 
the dimensions. Fer example the increment in displacement 
between BL[0;0;0] and B[0;0;1] is 1, between BL 0;0;0] and 

BL 0;1;0] is 11, and between B[0;0;0} and BL 1;0;0] is 55. Q2 
is the decrement in displacement needed when the indices 
differ in only one dimension but take on their extreme 
values in that dimension. For example the decrement in 
displacement between B[0;0;10] and B[0;0;0] is 10, between 
BE 03430} and BL 0;0;0] is 44, between B{2;0;0] and B[L0;0;0] 
is 110, and between BL 2;4;10] and B[0;0;0] is 110 + 44 + 10 
= 164. Notice that the operand SUM in the FA instruction 


above has an initial value of 164. The purpose of this 
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initialization will become apparent in the explanation of 


the FA instruction below. 


All EM instructions with mnemonics that start with an 
"I" such as IFA, are initialization instructions. When 
executed they initialize an associated instruction and 
Suppress the incrementation of the LS. Thus the next 


instruction executed is the newly initialized instruction. 


When the FA instruction is executed it invokes the 
Index Unit. The Index Unit alters the value of SUM and 
returns. The FA instruction then adds VBASE and SUM to 
generate a memory address from which it fetches a value. 


This value is then pushed to the top of the VS. 


The Index Unit used the LINK operand to access the ICB. 
It uses the INX operand of the first entry - QSADDR 12 - to 
index an entry of the IS. As ISCH is one in this entry, the 
Index Unit will use the ICB entry to increment or decrement 
SUM. As the IS entry appears to have overflowed, ISCTR is 
zero, ©1 is subtracted from SUM. The Index Unit continues 
with the other ICB entries in a Similar manner. On 
termination it has decremented SUM to zere. Before this FA 
instruction is executed again the LS will have overflowed 
and the IS stepped. In stepping the IS all the ISCH entries 


will have been zeroed. The IS entry at the top of the stack 
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will then be stepped and only this ISCH field reset to one. 
Thus the Index Unit will do nothing with first two ICB 
entries as ISCH is zero. The third entry, for which ISCH is 
one and ISCTR is not zero, will cause the incrementation of 


SUM by Q2. 


A question remains as to why SUM was initialized to 164 
rather than zero. By initializing in this fashion the logic 
to handle subscripted arrays is simpler. For the array 
AL 11;I1Z2]}] the values of I1 or I2 are fetched if the 
associated ISCH is one. By initializing all the ISCH fields 
to one the indices are fetched on the first pass through the 
EM code by the Index Unit. Thus the legic to fetch indices 
in the Index Unit need not be duplicated in the IFA 


instruction. 


4.4.1 Subscripted Arrays 


The handling cf subscripted arrays by the EM is 
intended to parallel as closely as possible that of non- 
subscripted arrays. The array accessing mechanism will have 
to be augmented so that the indices are used aS arguments to 
the polynomial access function rather than the values of 
ISCTR. However the IS still remains as an integral part of 


the accessing mechanism. Its function is illustrated with 


the aid of the following expression: 
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T+ ALB(C(Is]3)3] 
Assume that T 1s not a scalar or one-element array and that 
the dinensiongof Teis¥ijtof-Ckhis 243 7e0fe8 is#4> SaeandsoGna 
is 6 7. If the add operation has conformable operands then 
fT must have dimension 13 5 7. The values of T would be 


fetched using the IS which would have the following entries: 


Do tseeGLR eM AXx | DIR CH MRK ENTRY NO. 
0 0 C 0 1 1 
0 2 0 0 0 & 
0 u 0 0 0 5 
C 6 C 0 0 4 


Note: The Entry Number shown is for ease of reference in 
the text below and is not part of the IS. 

Notice that the values of I can be accessed using the first 
IS entry and that when this entry overflcws we will have 
fetched all the values of I necessary. The IS can not be 
used to access C[{1I;] directly but the dimension of C{I;] is 
1 3 which is the same as the first two entries of the IS. 
Thus when the first two entries of the IS overflow all the 
values of C{I;] will have been fetched. When the first 
three entries overflow B[C[I; ];] will have been fetched and 
when all four entries cverflow AL B[C{L1I;];];] will have been 


fetched. The IS will perform two functions in the accessing 
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1. When the relevant ISCH field is one a new value will 
be fetched frem a subscript. 
2. When all the values of a subscript have been fetched 


the relevant IS entry will overflow. 


Thestunctaon sor the TSCTR tandweiSMAx = ti1eldssof shemis 
are replaced by XT or pseudo-IS entries in the QS for 
subscripted arrays. XT entries have the following fields: 

xT index, limit,change, GN=3 
The first three operands parallel the ISCTR, ISMAX and ISCH 
fields of the IS. Group Number (GN) appears only in the 
first entry in a group of entries and determines the number 
of entries in the group. XT is used in conjunction with the 
ICB to access array elements. When the Index Unit (IU) is 
invoked it finds that the LSQP field is non-zero. The IU 
uses LSQP as a pointer to the XT entries. The IU accesses 
the first ICB entry and uses the INX value to select the 
relevant XT entry. If the change field of the XT entry is 
zero the IU goes onto the next ICB entry. Otherwise it 
subtracts from the operand SUM in the FA instruction which 
invoked the IU the Q1 value. It then adds to SUM the 
product of Q2 and the index field of the Xf entry. MThis 
product is then used to respecify Q1. Consider what occurs 
when the IU is first invoked for an FA instruction. SUM has 


been initialized to the displacement of the last element of 
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the array. Subtracting the Q1 values will decrement SUM to 
zero. Adding the product increments SUM to element 

specified by the indicy. Replacing Q1 by the product is the 
Simplest way of having SUM reset to zero next time the IU is 
invoked. The IU frocesses the remaining entries in the ICB 


in a similar fashion. 


EM instructions are used to alter the values of index 
and change in Xf entries. The two pairs of instructions 
used are: 

1. sc org,len,mode, MASK=00001 
Ao INX=4 
Zea LX L MASK=00010 
xs INX=4 
The first pair is used for fetching and storing subscripts 
in XT entires. The second pair is used to retrieving ISCTR 
and storing it in xf. This will occur when there is an 
empty subscript such as the first dimension in the 
expression A{I;]. An ISC instruction is used rather than 
IFA as the subscripts may be expressions rather than simple 
operands. In addition ISC provides for the specifying of 


the LSQF so that the IU will function correctly. 


Referring to the expression above, A{B[C{1I;];];], the 


EM code to fetch the values of I is: 
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CSADDR Operation Operands 
1 IFA @1,MASK=00001 
The EM code to fetch the vaiues of C[{I;] is: 
QSADDE Operation Operands 


* Code to fetch the value of I 


2 IFA @C,MASK=00011 

3 XT 0,1,1,GN=2 

i XT Oe nt 

S ISC 4,1,1,MASK=00001 
6 ps) LINK=3 

7 IXL MASK=00010 

g XS LINK=4 

S) SG 7,1,1,LINK=6 


To invoke this segment of code an entry is pushed to the LS 
which defines QS addresses 5 through 9. The ISC instruction 
at QSALDR 5 causes the IFA instruction at QSADDR 1 to be 
executed. The follcwing XS instruction stores the value of 
I in the XT entry at QSADDR 3. Notice that the index fields 
of the XT entries are zero prior to XS instructions being 
executed. The following IXL, XS pair push the value of 
ISCTR to the XT entry at QSADDR 4. The SG instruction at 
QSADDR 9 executes the IFA instruction at QSADDR 2 anda 
value of C is pushed to the VS. The EM code to fetch the 


values of B[C{L1I;];]} is: 
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QSADDR Operation 


* 


10 


As 


12 


13 


14 


LS 


16 


tie 


Operands 


Codestortetch #cpr:4 


IFA 


XT 


XT 


SG 


@B,MASK=00111 
0,3,1,GN=2 

0,4,1 
8,5,1,MASK=00001 
LINK=3 
MASK=00100 
LINK=4 


7,1,1,LINK=6 


85 


This segment of code executes similar to the above segment. 


However, 


segment of code at QS addresses 5 through 9. 


the ISC instruction at QSADDR 13 invokes the 


fetch the values cf A[{B({C[1I; ];];] is: 


QSADDR Operation 


0 


* 


JMP 


Operand 


21 


Code to fetch BL C[ I; }; ] 


IFA 


XT 


XT 


XS 


SG 


@A,MASK=01111 
0,5,1,GN=2 

0,6,1 

8,5, 1,MASK=00001 
LINK=3 
MASK=01000 
LINK=4 


1, 1,1, LINK=6 


The EM code to 
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The LS entry which invoked this segment has LSORG set to 
zero and LSLEN set to 25. The LSIS bit is one, thus this 
segment will be repeated until the IS overflows. The first 
dimension of A[B(C{I;];];] is 7. Thus the instructions at 
QS addresses 0, 21, 22, 23, 24, 25 are executed 7 times 


before the IS overflows. 


4.4.2 Reduction 


An expression ccntaining a Reduction operator produces 
a result of rank cne less than the operand of the Reduction. 
The DM jnitializes the IS to represent the rank and 
dimensions of the result. Thus, in order to use the IS to 
access the operand of Reduction, instructions are inserted 
in the EM program segment to push an entry to the IS. This 
is illustrated by the following example: 


APL Expression: A-+/B 


OSADDR Operation Operands 
0 RED LINK=3 
les IFA @B,MASK=00111 
2 OP ADD 
a SGV gig are 
4 S me 
5 MIT 


6 ITED @TEMP, MASK=00011 
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7 IFA @A,MASK=00011 

8 OP SUB 
Assume A has dimensions 2 3 and B 2 3 5. The LS entry for 
this segment invokes the instructions at QS addresses 0 
through 8. The first instruction pushes an entry to the VS 
with tag RI and branches to QSADDR 3. The SGV instruction 
pushes a segment descriptor for QS addresses 1 and 2 to the 
VS with tag SGT. The following instructicns pushes value 
minus five to the VS with tag ST. Five is the value of the 
third co-ordinate in the dimension of B. The minus sign 
will indicate to the following MIT instruction that the 
ISDIR field is to be one and consequently the reduction 
process will decrement along the third co-ordinate of B. 
The MIT instructicn pops all entries with tag ST from the 
VS. As it pops these values it uses them to specify ISMAX 
in IS entries it is constructing. MIT then pops the SGT 
entry from the VS and uses it to push an entry to the LS. 
The new LS entry under IS control iterates through the 
instructions at QS addresses 1 and 2 five times and is then 
popped as the IS overflows. This calculates one scalar 
component of +/B. This scalar component is then subtracted 
from a scalar component of A. The LS iterates through the 
QS addresses 0 threugh 8 six times before the IS overflows 


and A-+/EB is calculated. 
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In the more general case where an index is included in 
the Reduction expression such as +/[K]B the DM constructs EM 
code to calculate a different but equivalent expression. If 
B is a rank 4 array then the expressions telow are 


equivalent: 


+/(1I28 is equivalent to WY (4 2 2 38 
+/C2I1B is equivalent to +/(1 4 2 3)a&B 
+/C3]B is equivalent to +/(1 2 4 3)Q&B 
+/C4IB is equivalent to +/(1 2 3 4)&B 
+/C4IJB is equivalent to +/B 


In general +/([KJB is equivalent to +/A&B where A is: 
(1K-1),0F/i1pp8),((K-1) +1 (pp )-K) 
As the Iranspose cprerator is beaten by the DM the general 


case is equivalent tc the particular case illustrated above. 


An expression containing an Cuter Product produces a 
result of greater rank than either of the operands. Like 
Reduction the IS is initialized to represent the rank and 
dimension of the result. As the dimension of the result is 
the catenation of the dimensions of the operands the entries 
in the IS can be used directly to fetch the operands. Ali 
that is needed is to alter the masks of the IFA instructions 


that fetch the operands. Consider the fcllowing example: 
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QSADDE 
0 
1 
2 


3 


m 


5) 


A+Bo,xC 

Operation Operands 
IFA @c , MASK=00011 
IFA @B,MASK=01100 
OP MULT 
IA @A,MASK=01111 
ASGN 
PCP 


Assume the dimension of B is 3 5 and C is 7 9. The 


dimension of A is then 3 579. The IS will contain: 


153. CTR MAX 


0 
0 
0 
0 


2 


4 


6 


8 


DIR 
C 
0 
0 
C 


CH 


0 
0 
0 
0 


MRK ENTRY NO 
1 1 
0 2 
0 3 
0 4 
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The MASK for array C selects IS entries 3 and 4 to fetch the 


values of C. 
C thus selecting entries 1 and 2. 


logical or of the masks of the operands and thus selects 


The MASK for B is shifted left by the rank 


the entries of the IS. 


times and each value of B is fetched x/pC times, 


operands are expressions they are calculated to temporary 


The MASK for A is the 


of 


all 


As each value of C is fetched x/pB 


space before the Outer Product is deferred. 


if the 


To calculate Inner Products the DM constructs EM code 
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to calculate a different but equivalent expression. If the 
original expression is Bt+.xC then the equivalent expresion 
is +/AQBo,xC where A is the following: 

(1 1+p9B),(2pf/r1(ppB)+(ppC)-1),( 1+ppB) +t 1+ppc 
For example if the dimension of B is 2 3 4 and C is 4 5 6 
then the DM generates the expression: 

4/((1 2),€5 5),(3 4))QBo,xC 
This expression is beaten then handled in the same manner as 


the Reduction expressicns discussed above. 


The purpose of the APLM is to achieve efficiency in the 
execution of APL expressions by beating and defferal. 
Beating reduces the number of operations required to 
evaluate an expressicn. Deferral makes mcre efficient use 
of low speed core memory. To achieve these ends the 
hardware design is made to closely resemble the design of 


the language. 
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CHAPTER V 


This Chapter reviews the main points of the previous 


Chapters and analyzes the result. 


Iverson{ 14] originally designed APL to be a language 
for the description and communication cf algorithms. In 
implementing APL the I/O facility, tree structures, and many 
of the operators were omitted. In addition much of the 
notation and character set were altered. Despite these 
omissions the resulting language is a highly consistent, 
mathematically oriented language. However there are faults. 
Subscripting was net fully generalized to arrays and the 
syntax is context dependent. Subscripting can be 
generalized to arrays by the addition cf a mixed-data type. 
The mixed-data type, being similar to parameter lists in 
other languages, will also allow more than two operands to 
be explicitly passed to a function. Operands were not 
passed to functions by the same mechanism as operands are 
passed to primitive operators. By passing operands to 
functions by name we can make consistent the facility of 
functions augmenting the primitive functicn set. The Branch 
operator was deleted because its function could better 


served by the control structures IF, THEN, ELSE and REPEAT, 


af mo boa +éndeeeeneas tetas [er — on 
a © a _ 
2 -_- 1 rt BOV 5 1 qj it Mea eold |220896 ed 0% 


HUST: ads. iS pobeaeig 


: bat (agro stoniys oad "fy zee 


iu fu! acre hbba at eer 2 3eW s10*s 3890 
ie 


ries? .towazis e¥sv #2e seib nse > bas’ 
; s ¥ ; ~~ ® ae 
: yo vivords Feat Macy f paiehi 2 wit Qnoke + anil 
Je ae ~ 
eis] pau sjaht sp yewor 1 ete isokteaedtea 


Live sits ps Bascis Mahe Pos Joan 8 std Sere 
cede 2 ds ey aan 2402 


.  Weo bedty troedae Sbpea es 2eesn0s as ; 
eal a om ; ¥ rss es ts ir 


rl : t - - 
.eits (/PiSi-feanig 6.30 aukaey 1 ent Yo. EY6235 O27 Hots S166 
‘ ; é pi i” a 


JOTeI : ry 4 SGA Reo eased e7at7 sptob-bs 7 ait 


tS ‘Sbirszey6 ov). cet! o4dow, Wolis oats Lie tala 


Joqverev ahnsia) ~ .qditonvs vii oF hain 24 ind 
S36 Gimeiegé = Windooe abe. add co a Leste 4 Gen 
7 


at —: jo pat aah yc -p1atez9q0 3% e aes ot) 


+ 
D3SS56% 
a a 7 


— 7 @ oe ot 
od wuabton 2.805 sav ealbazaon - oxnal i439 oes rare aa OLFD) L 
aan cain ; 
ives si SGP .*6e Nats 2qu% Bye rat Pas a 


ar atsol biuos | wort jouw) ait: 
ay pa ; 

. abTe Wit hy Bs, a 26 a4 Bey i iv 
: - — a ; _ 


we 


7 _ - 


vr, 


92 


END. The new syntax adheres to Iverson's criteria of 
clearly exhibiting the constraints on the seguence in which 
Operations are performed. In addition it implements the 
method of leading decisions and allows no alternatives. The 
operators Encode, Decode, Index Generator and Index Of are 
extended to arrays from their previously restricted 


definitions. 


A new operator, Rename, and the addition of an I/0 
facility were discussed in Chapter III. Rename was added to 
allow variables, especially “parameter lists" or variables 
of type mixed-data, to be renamed and manipulated without 
the necessity of duplicating the arrays involved. The 
facility provided ky Rename is consistent with the facility 
provided by functicn invocation - that of values being known 
by another name. An IyO facility for sequential files was 
added to increase the usefulness of the language. It 
satisfies Iverson's criteria that it be concise, consistent, 
mnemonic and economical of symbols to a greater degree than 
the ad hoc implementing of I/O functions and operators. The 
I/O facility was designed to be reasonably simple and 
straight-forward to implement. It does not provide the full 
complement of facilities such as indexed or shared files 


that would be useful. 


Improvements to the implementation consist of 
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modifications to Akrams proposal for an APL machine. 

Abrams" AFIM is a stack oriented machine divided into three 
parts. The CM ccnverts APL text to a more readily 
interpreted DM code. DM code is in an opcode-operand form 
that resembles the machine code of mary computers. The DM 
interprets the DM code, analyzing it into deferrable and 
beatable expressicns. The results of the analyses is 
executable EM code. The EM code has well defined operations 


and operands and is executed directly. 


A subset of the EM was implemented in IBM/360 Assembler 
Langauge. The subset worked and demcnstrated that the 


fundamental design of Abrams' machine is valid. 


The advantages of the APLM are the following: 

1. DM code can ke interpreted efficiently. 

2. Deferred expressions reduce the number of references 
to intermediate speed storage. 

3. Beating reduces the number cof computations necessary 
to evaluate an expression. 

4. The stack discipline of the AFIM organizes 
frequently used data into the high speed storage or 
stacks. 

The fourth point needs some qualification. If the high 
speed storage is an crder of magnitude faster than 


intermediate speed storage then the stack discipline is 
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Clearly advantageous. However the advent of scratch-pad 
memories and other technological advances reduces the 
advantage of the AFLM. In the simulation of the APLM on a 
machine with no stack hardware deferred expressions are a 
disadvantage, unless beating occurs to reduce the number of 
calculations so as to gain an overall advantage. Some of 
the disadvantages cf the APLM are: 

1. The generation of a precise error indication for 
errors such as division by zero in a deferred 
expression is an involved procedure. 

2. The number and size of the various registers makes 
task switching in a multi-programming environment 
unattractive. 

3. In order tc gain the advantages mentioned above the 
IS, LS, QS, and VS should be implemented in high 
speed register storage which may be costly. In 
addition the logic of the DM and EM would be 
microprogrammed using a considerable amount of 
storage. 

The overall advantage of the APLM is dependent on changing 
technology. The introduction of scratch-pad memories to 
conventional implementations may reduce the advantage of the 


APLM to the point where its extra cost is unwarranted. 
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Appendix A 


Summary of APL Operators 


35 


Pd 7 os a 


Definition Definition 
or example or example 


+B +> O+B Plus + | Plus 24:9 2) + Sin 


-B Negative Minus Qa aie roel 2 


Signum Times 2%3./2 ++ 6.4 


Reciprocal + | Divice 243.28 +> 50,6025 
Ceiling Maximum Sev) ere 7 
Floor Minimum a7 e473 

*B «> (2.71828..)*B| Exponential Power 2*3 «> 8 


@xV ++ N ++ x@N Natural Logarithm A@®B ++ Log B base A 
logarithm A@B ++ (@B)+@A 


}~3.14 ++ 3.14 Magnitude Residue 


A=0,B<0| Domain error 


al Factorial ' | Binomial A'B <> (!B)t(!4)x!B-A 
Bx!B-1 coefficient /2!5 «+ 10 3!5 ++ 10 
++ Gamma(B+1) 


Random choice ? | Deal A Mixed Function (See 
from 1B Table 3.8) 


Bx3.14159...0 | Circular See Table at left 


0 AQ) xz 


And A AAB|AVB|AKBi AB 
Or 0 at 
(1-Bx2)*.5 (1-Bx2)*.5 Nand 0 0 
Arcsin B Sine B Nor 1 0 
Arccos B Cosine B 7 0 
Arctan B Tangent B 
(~1+Bx2)*%.5 (1+Bx2)*.5 Less Relations 
Arcsinh B Sinh B Not greater Result is 1 if the 
Arccosh B Cosh B Equal relation holds, 0 
Arctanh B Tanh B Not less if it does not; 


Greater SHES 7/ sees Gl 
Table of Dyadic o Functions Not Equal 7s'3 +> 0 
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Conformability Definition 
oA oAf.gB | requirements Z+A£.gB 
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APL Inner Products 


Definition 
Z+A°.gGB 


Z+AGB 
ZCIIJ<AgGBCI) 
ZCIIJ<ACIIgGB 

Z2CI:JIJ+ACIIgBC(J) 
Z(I3;J)+AgB(I;J) 
ZLI3;JIJ+ACLI;JIgB 

ZOI:J3KI*ACIIGBCI;K) 

ZUOIsd3KIJ*+ACI;3JIgGB(K] 

WIZCI3sJ73;K3;LI+ACI;JIGBCK;L] 
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Definition or example? 


Size pP ++ 4 pF «> 3 4 e5 «> 10 


Reshape Reshape A to dimension V opi 128 «ae FE, 
AAelj si ial Ope +> 10 


»4 ++ (x/pA)pA »E +7112 p,5 +> 1 
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EC1;) «+ 1234 ABCD 
9 "ABCDEFGHIJKL'(E] ++ EFGH 


Index 
generators 


Web sese BY Ale eh) 
10 ++ an empty vector 


First S integers 


Index of3 Least index of A Pi3 +2 581 25 
an Vi, OL 1t+5V PiE «++ 3 5 4 5 
4 4Y4 <> 1 Soo 
Take or drop |V{I] first 2 3tX ++ ABC 
bt (VCI]20) or last (V{£I]<0) EFG 

Drop V+A elements of coordinate I "24P «+ 5 7 


) The permutation which N35 ceo ieee 
would order A (ascend=- 


ing or descending) 


Grade aa MA 
Grade down*5/V4 


| Pee Wal iS reine rg) BE 


Compress 101 0/P ++ 2 5 10120 /F <> 5 77 
ake 
fen / 2 1b e+e 25 ee Ge se nO al 2 
PW Heh be 416) 
A BCD 
feo N12 +> 1) 02 1 Ose tA ee BPG 
I_JKL 


DCBA Pee 
Reverse® OX «+ HGFE OC1]X ++ OX ++ EFGH 
LKJI oP «+ 75 3 2 ABCD 

BCDA 

Rotate® 30P ++ 72 3 5 ++ ~16P 10 ~1bX ++ EFGH 


LIJK 
AETI 
Coordinate I of A 2 1QX ++ BFJ 
becomes coordinate CGK 
V(IJ of result 1 1QF ++ 1 6 11 DHL 


Transpose 


Transpose last two coordinates QE «> 2 12 


oy ata 
Membership pWeY «+> oW EeP +> 1 ° i @ 
2a eee Sh ate) 0) fe) OmO 
Decode AON GY yf Ts) reo BLA YAS: PAS {e}(0) fey) i al = S) Sea Speeds 
Encode VTS 2* ‘AO, HOurely es) seco al 2s (SiO) (SOP CARS) Sea tS) 
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APL Mixed Functions 
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Restrictions on argument ranks are indicated by: S for 
Scalar, Vo storavectors;e for “matrix, 4 for Any. Except as 
the first argument of S14 or S[A], a scalar may be used 
instead of a vector. A one-element array may replace any 
scalar. 


Arrays used 4 ABCD 
in examples: Pecos 2. 3 95.77 E 8 X +> EFCH 

@) ak@) kal bz IJSKL 
Function depends on index origin. 


Elision of any index selects all along that coordinate. 


Themserunction | is applied along Gthe “Last coordinate; the 
Symbols’) 7, <, "and "e “are equivalent "to "Fs =\e=eands 0, 
respectively, .except thatthe) function is applied alongitne 
first coordinate. If [5] appears after any of the symbols, 
the relevant coordinate is determined by the scalar 5S. 


Notes for Mixed Functions 
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